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High-Performaiice  Computing  in  Europe 

by  Miroslaw  Malak.  Dr,  Matek  was  the  Liaison  Scientist  for  Computer  Science  and 
Computer  Engineering  in  the  Office  of  Navai  Research  European  Office  white  on  leave 
from  the  University  of  Texas  at  Austin,  where  he  hjids  the  Southwestern  Belt 
Professorship  in  Engineering  in  the  Department  of  Electrical  and  Computer  Engineer¬ 
ing. 


EXECUTIVE  SUMMARY 

A  wide  spectrum  of  activity  is  occurring  in 
high-performance  conq)uting  in  Europe.  It  ranges 
from  advanced  microprocessor  and  memory  tech¬ 
nology  to  system  design,  formal  methods,  and 
software.  In  addition  to  key  players  in  France,  the 
United  Kingdom  (U.K.),  and  the  Federal  Republic 
of  Germany  (FRG),  there  are  pockets  of  excellence 
in  The  Netherlands,  Italy,  Belgium,  Sweden,  Sv/it- 
zerland,  Finland,  and  other  countries. 

European  efforts  have  been  accelerated  by  sev¬ 
eral  high-profile  European  Community  (EC)  sci¬ 
ence  and  technology  programs.  Main  endeavors 
include  the  European  Strategic  Programme  for 
Research  and  Development  in  Advanced  Communi¬ 
cation  Technologies  in  Europe  (RACE)  and  the 
Joint  European  Submicron  Semiconductor  Initiative 
(JESSI),  which  is  partially  supported  by  the  Euro¬ 
pean  Ri^earch  Coordination  Agency  (EUREKA) 
and  the  European  Strategic  Programme  for  Re¬ 
search  and  Development  in  Formation  Technolo¬ 
gies  (ESPRIT). 

ESPRIT  is  in  its  third  3-year  phase  and  sup¬ 
ports  a  broad  spectrum  of  R&D  efrorts  in  conq)uter 
science  and  conq>uter  engineering  at  a  level  of 
about  $10  billion  during  1985-95.  The  focus  is  on 
four  main  research  areas: 

•  Microelectronics 

•  Software  Engineering  and 
Information  Processing  Systems 

•  Advanced  Business  and  Home 
Systems;  Peripherals 

•  Computer-Integrated 
Manufacturing  and  Engineering. 


In  addition,  there  is  one  general  program  (Basic 
Research  Actions)  and  six  large-scale  targeted  pro¬ 
jects.  These  projects  are: 

PO.  Open  Microprocessor  Systems 
Initiative 

PI.  Microelectronics 

P2.  Peripherals 

P3.  European  Software  and  Systems 
Initiative  (ESSI) 

P4.  High-Performance  Computing 

P5.  Computer-Integrated 
Manufacturing. 

The  primary  goals  of  the  new  3-year  phase  in¬ 
clude: 

•  providing  state-of-the-art  application-specif¬ 
ic  integrated  circuits  (ASICS)  manufacturing 
capability; 

•  developing  next-generation  computer  tech¬ 
nology;  and 

•  integrating  multiple  media  over  heteroge¬ 
neous  computer  systems  in  a  wide  range  of 
applications  environments. 

In  parallel  computing,  the  Inmos  transputer  has 
been  a  backbone  of  several  projects.  Inmos,  after 
being  acquired  by  SGS-Thomson  in  1989,  is  ag¬ 
gressively  pursuing  the  development  of  powerful 
microprocessors  and  communication  chips.  The 
recently  announced  T90()0  microprocessor  will 
deliver  a  150  MIPS/20  MFLOPS  peak  performance 
and  should  be  available  this  year. 

ESPRIT  already  has  a  number  of  parallel 
computing  efforts  that  focus  on  architecture 
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algorithms  and  the  education,  development,  and 
management  of  parallel  and  distributed  systems.  In 
the  post-RISC  Supemode-2  microprocessor  initia¬ 
tive,  Meiko  and  Inmos  will  be  leading  a  $120 
million  development  to  create  next-generation 
general-purpose  parallel  computers  based  on  new 
transputers.  Also,  S5  European  universities  have 
acquired  supemode  systems  to  experiment  with 
parallel  software  (applications,  languages,  and 
operating  systems). 

RACE  focuses  on  broadband  communications, 
with  emphasis  on  integrated-services  digital  net¬ 
works  (ISDN)  and  was  funded  at  about  $660  mil¬ 
lions  for  five  years— from  1987  to  1992.  With  a 
variety  of  switching  systems,  services,  communica¬ 
tion  organizations,  and  a  myriad  of  different  regu¬ 
lations,  RACE  will  concentrate  on  standardization 
and  the  development  of  a  unified  ISDN  to  provide 
cheap  and  reliable  communications  in  Europe. 
Special  projects  range  from  network  management 
to  di*!tributed  design  and  the  manufacturing  of 
multilayer  printed  circuit  boards.  Other  projects 
include  distributed  multimedia  publishing, 
multigigabit  network  projects,  telecommunications 
software,  multimedia  architecmres,  and  network 
management.  Within  the  EC  atone,  11  telecommu¬ 
nication  administrations,  89  universities,  and  more 
than  230  companies  are  involved  in  tne  RACE 
program. 

The  EUREKA  program,  proposed  by  President 
Mitterand  in  1985  in  response  to  the  U.S.  Strategic 
Defense  Initiative  (SDI),  concentrates  on  informa¬ 
tion  technology  (25  percent),  manufacturing  (17.5 
percent)  biotechnology  (13  percent),  and  materials 
(12  percent)  but  is  available  to  practically  any  area 
of  science  that  will  contribute  to  technological 
progress.  Almost  300  projects  from  19  countries 
are  funded  for  a  total  of  almost  $8.5  billion.  Key 
projects  include  High  Definition  Television 
(HDTV)  and  JESSI,  for  which  only  partial  support 
comes  EUREKA. 

The  JESSI  program  is  a  Japanese-style,  gov- 
emment-industry-academia  effort  geared  toward 
making  Europe  indepemlent  in  semiconductor 
technology  by  the  year  2000.  The  program  initial¬ 
ly  was  scheduled  for  eight  years  (1987-95), with 
anticipated  level  of  funding  expected  to  be  $8 
billion  (with  the  EC  governments  contributing 
more  than  $4.5  billion).  Major  companies  such  as 
Siemens,  SGS-Thomson,  and  Philips  are  involved. 


After  much  deliberation,  IBM  was  allowed  to 
join  in  1991  because  of  IBM’s  involvement  in 
Sematech,  a  U.S.  multicompany  semiconductor 
consortium  established  in  Austin,  Texas.  In  addi¬ 
tion,  IBM  and  Siemens  signed  a  joint  agreement  to 
develop  a  64  Mbit  random  access  memory  (RAM) 
chip.  JESSI’s  future  is  'incertain;  there  are  ques¬ 
tions  about  the  necessity  for  such  a  program.  In 
this  recessionary  period,  some  companies  are  not 
willing  to  commit  such  substantial  Ending  to  joint 
research  programs. 

Parallel  computing  is  dominated  by  transputer- 
based  system  houses  such  as  Meiko  (U.K.),  Parsys 
(U.K.),  Parsytec  [Federal  Republic  of  Germany 
(FRG)],  and  Telmat  (France).  These  companies 
have  sold  more  than  500  systems  called  Super- 
Nodes,  which  range  in  size  from  16  to  400  trans- 
sputers.  A  popular  systems  topology  seems  to  be  a 
two-dimensional  mesh,  although  there  are  propos¬ 
als  to  build  a  10  x  10  x  10  three-dimensional  ma¬ 
chine.  Two  400-transputer  systems  are  installed  at 
Royal  Shell  in  The  Netherlands  and  the  University 
of  Edinburgh,  respectively.  The  Royal  Signals  and 
Radar  Establishment  (renamed  the  Defence  Re¬ 
search  Agency)  at  Malvern,  U.K.,  claims  perfor¬ 
mance  of  350  MFLOPS  with  a  256-transputer 
system. 

The  IBM  Victor  project  at  Yorktown  Heights 
(New  York)  is  based  on  European  transputer  tech¬ 
nology.  It  resulted  in  the  development  of  several 
multiprocessor  systems  with  16,  32,  64,  and  256- 
transputers.  A  number  of  applications  ranging 
from  modeling  of  oceanic  wind  currents  to  fault 
simulation  and  superconductivity  have  been  in¬ 
stalled.  The  goal  of  this  project  is  to  provide 
researchers  with  a  platform  for  experimentation  in 
message-passing  multiple  instruction,  multiple  data 
stream  (MIMD)  parallel  processors  with  distributed 
memory.  The  Victor  V256  is  a  16x  16  mesh  of 
transputers  that  delivers  almost  linear  speedup  for  a 
number  of  applications.  The  V256  performs  espe¬ 
cially  well  for  computation-intensive  Monte  Carlo 
codes. 

The  GENESIS  project  (initially  supported  by 
ESPRIT  II)  was  geared  toward  developing  "the 
world’s  fastest  computer,"  with  up  to  4096  of 
Intel's  860/870  processors  running  Fortran  90  at 
100  GFLOPS.  It  seemed  to  be  a  follow-up  to  a 
German  supercomputer  SUPRENUM  project.  The 
hardware  was  to  be  developed  by  KAE,  Stollman, 
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GMD,  and  the  Technical  Univers i(\  Berlin. 

The  software  would  involve  Chorus,  KAE, 
Suprenum,  and  the  GMD.  It  has  been  planned  to 
emphasize  a  wide  range  of  applications  (e.g., 
aerodynamics,  computational  fluid  dynamics,  struc¬ 
tural  mechanics,  partial  differential  equations, 
weather  forecasting)  that  are  to  be  developed  by  a 
large  number  of  European  companies.  With  the 
current  shift  in  funding,  this  project  might  be 
scaled  down  and  supported  by  the  Japanese,  within 
the  Real  World  Computing  Initiative. 

Several  efforts  are  also  aimed  at  building 
special-purpose  supercomputers.  The  most  notable 
include  the  Italian  APE  100  and  the  British  Associa¬ 
tive  String  Processor,  Distributed  Array  Processors 
(DAP),  and  Quantum-Chromo-Dynamics  (QCD-20) 
systems.  APEIOO  and  QCD-20  machines  are 
designed  to  solve  a  QCD  problem.  The  European 
effort  aimed  at  developing  Europe's  fastest  data¬ 
base  machine  is  also  worth  noting.  The  European 
Declarative  System  (EDS),  under  development  at 
the  European  Research  Center  in  Munich,  aims  at 
12,000  transactions  per  second. 

APEIOO  will  be  built  of  special  floating-point 
units  capable  of  delivering  50  MFLOPS  in  1.2-/tm 
technology.  The  1-GFLOPS  machine  was  ready  in 
1991,  and  a  128-unit  6-GFLOPS  machine  was 
scheduled  for  1992.  The  full-size  system  is 
planned  to  have  a  16  x  16  x  16,  three-dimensional 
array  of  4096  units  with  200  GFLOPS  performance 
in  1993. 

The  QCD-20  is  based  on  a  Meiko  i860  Vector 
Board  with  two  T800  transputers,  an  i860  co-pro¬ 
cessor  and  8  Mbytes  of  RAM.  Each  of  the  four 
modules  contains  64  i860  vector  nodes;  a  256-node 
system  should  have  an  expected  performance  of  17 
GFLOPS.  This  performance  will  provide  about 
three  times  the  speed  of  the  IBM’s  GP-11. 

The  Associative  String  Processor  at  Brunei 
University  (London,  U.K.)  is  an  associative  single- 
instruction  multiple-data  (SIMD)  computer  on  a 
wafer.  Active  Memory  Technology  Ltd.  (Reading, 
U.K.)  has  developed  a  range  of  parallel  computing 
machines  using  two-dimensionai  arrays  of  1-bit 
processors,  lliese  processors  are  based  on  DAP,  a 
predecessor  of  Connection  Machines  (CM-1, 


CM-2).  The  latest  DAP,  called  Starlight,  has  a 
goal  of  achieving  1.2  GFLOPS  from  a  small  cylin¬ 
drical  package,  which  would  be  5  inches  in  diame¬ 
ter  and  2.5  inches  high. 

In  addition  to  general  and  special-purpose 
processing,  Europe  has  a  number  of  fascinating 
projects  in  distributed  computing:  Amoeba  (The 
Netherlands)  is  a  state-of-the-art  operating  system 
with  the  capability  of  exploiting  a  multiprocessor 
assembly  for  high  performance;  Delta-4  (France), 
which  aims  at  high  dependability;  and  ANSA 
(U.K.)  and  REX  (U.K.)  have  distributed  comput¬ 
ing  environments  that  trade  flexibility, 
performance,  and  dependability. 

Europe  traditionally  has  been  strong  in  formal 
methods  and  computer  languages.  Formalisms  by 
Hoare  (Communicating  Sequential  Processes)  and 
by  Milner  (Calculus  of  Communication  Systems) 
are  well  established  and  ingrained  in  the  world’s 
computing  theory  community,  while  languages 
such  as  Pascal,  Prolog,  Ada,  Modula-2,  and  Occam 
are  used  by  many.  Europe’s  operating  systems 
such  as  Chorus  and  Amoeba  are  formidable  com¬ 
petitors  of  similar  efforts  in  the  United  States. 

In  the  fast  changing  world  of  high-performance 
conqjuting,  the  United  States  continues  to  lead,  but 
with  growing  impact  and  participation  of  Europe 
and  Japan,  Globalization  of  science  accelerates  the 
proliferation  of  good  and  bad  ideas.  Those  who 
will  be  able  to  use  good  ideas  and  turn  them  into 
competitive  products  will  dominate— not  only  in 
computer  science  but  in  other  areas  that  are  heavily 
dependent  on  computing. 

This  report  describes  the  status  of  European 
high-performance  computing  up  to  March  1992. 
The  new  ESPRIT  III  program  was  launched  in 
April  1992. 

For  further  information,  contact 

ESPRIT  Information  Desk 

Co?nmission  of  the  European  Communities 

BREY  10/178 

200  Rue  de  la  Loi 

B-1049  Brussels 

Belgium 
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1.  INTRODUCTION 

Europe,  Pacific  Rim  countries,  and  the  United 
States  continue  to  play  a  major  role  in  the  specifi¬ 
cation,  design,  and  manufacture  of  computer  sys¬ 
tems.  The  goal  of  this  report  is  to  shed  some  light 
on  developments  in  high-performance  computing 
for  all  who  are  interested  in  parallel  and  distributed 
computing  in  Europe.  (In  this  report  I  use  the 
term  "high-performance  computing"  in  a  broader 
sense— to  encompass  not  only  parallel  but  also 
distributed  confuting  systems.)  I  hope  it  will  help 
scientists  world-wide  to  better  understand  each 
others’  efforts  and  get  a  perspective  on  European 
research  as  the  trend  toward  globalization  contin¬ 
ues. 

The  report  is  divided  into  five  sections.  The 
Executive  Summary  gives  a  brief  ovciview  of 
major  projects.  The  Introduction  focuses  on  basic 
concepts  and  definitions  (many  readers  can  safely 
skip  this  section).  However,  readers  who  are  not 
familiar  with  Valiant’s  model  of  parallel  computa¬ 
tions  and  those  who  want  to  learn  more  about  the 
emerging  model  of  distributed  virtual  (and  other) 
memories  should  read  the  latter  part  of  this  sec¬ 
tion)  Section  2  provides  basic  definitions,  con¬ 
cepts,  and  taxonomies.  Section  3  is  concenied 
with  high-peiformance  computer  architectures. 

After  a  brief  introduction  I  describe  transputer 
developments,  which  are  the  backbone  of  many 
European  projects.  Under  general-purpose  parallel 
computing  I  cover  Meiko,  Parsys,  Parsytec,  and 
other  manufacturers  of  parallel  trunsnuter-based 
systems  as  well  as  GP-MIMD  [general-purpose 
h^MD  (multiple-instruction,  multiple-data  stream)] 
and  GENEvSIS  systems.  Special-purpose  parallel 
computers  are  then  described;  these  include  the 
APEKX),  Associative  String  Processor,  Distributed 
Array  Processor;  a  parallel  database  machine  called 
European  Declarative  System,  and  special-purpose 
transputer  systems.  The  last  part  of  Section  3 
concentrates  on  distributed  conq)uting  systems, 
such  as  Amoeba  and  Delta-4,  and  also  describes 
flexible  evolutionary  software  environments  such 
as  ANSA  and  REX  for  distiibuted  computing. 

The  focus  of  Section  4  is  software.  Starting 
with  an  overview,  pavallcl  languages  (both  impera¬ 
tive  and  declarative)  developed  in  Europe  are 
introduced.  Then  oporating  systems.  Chorus  at^l 
PEACE,  are  discussed,  followed  by  a  description 


of  European  Software  and  Systems  Initiative 
(ESSI).  Finally,  in  Conclusions  the  status  of  these 
systems  are  assessed,  including  speculations  on  the 
future  of  computing. 

My  background  is  in  parallel  computer  archi¬ 
tectures,  so  this  report  is  biased  toward  this  topic. 
A  number  of  areas,  such  as  formal  models,  appli¬ 
cations,  and  innovative  technologies  (e.g.,  optical 
and  molecular  computing),  have  not  been  covered 
in  this  report.  This  report  is  based  on  the  author’s 
personal  experience  during  his  visit  to  Europe  and 
presents  only  a  fraction  of  the  many  faces  of  high- 
performance  computing  in  Europe. 

Europe’s  diversity  and  its  traditions  in  innova¬ 
tion  and  engineering  make  it  a  fertile  ground  for 
many  ideas  to  grow  from  concept  to  implementa¬ 
tion.  There  is  no  doubt  that  Europe  plays  and  will 
continue  to  play  an  important  role  in  high-perfor¬ 
mance  computing,  especially  in  the  areas  of  formal 
methods,  computer  languages,  and  special-purpose 
and  distributed  computing. 

2.  BASIC  DEFIMTION,  CONCEPTS, 

AND  TAXONOMIES 

DEFINITION 

Although  controversy  continues  about  distin¬ 
guishing  attributes  between  parallel  and  distributed 
systems,  the  objectives  here  are  to  explain  major 
differences  and  similarities  between  these  two  types 
of  systems  and  to  define  other  forms  of  parallelism 
and  distribution. 

A  multicomputer  (multiprocessor)  is  a  comput¬ 
er  system  that  comprises  more  than  one  indepen¬ 
dent  processing  unit.  Multicomputer  systems  are 
generally  divided  into  two  general  categories: 
parallel  and  distributed. 

Parallel  systems  use  fast,  usually  parallel, 
networks;  distributed  systems  use  serial,  local 
area,  or  long-haul  networks.  Traditionally,  parallel 
systems  are  built  as  a  single  assembly  that  t}i>ically 
uses  a  synchronous  clock;  distributed  systems  are 
physically  distributed  within  an  establishment,  and 
each  con^uter  uses  a  separate  clock.  The  distinc¬ 
tions  are  fuzzy,  and  the  term  "multicomputer  sys¬ 
tem"  serves  as  an  umbrella  term  for  both  parallel 
and  distributed  computer  systems.  Comparison  of 
the  main  attributes  of  parallel  and  distributed  sys¬ 
tems  indicates  that,  except  for  physical  distribution. 
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most  features  of  distributed  systems  can  be  found 
in  parallel  systems. 

The  main  attributes  of  parallel  systems  are: 

•  multiplicity  of  resources, 

•  physical  proximity  of  resources, 

•  a  high-level  operating  system  that  unifies 
and  integrates  the  control  of  resources,  and 

a  system  transparency  whereby  the  resource 
does  not  have  to  be  identified  with  the  user. 

The  main  attributes  of  distributed  systems  are: 

•  multiplicity  of  resources, 

•  physical  distribution  of  resources, 

•  asynchronous  clocking, 

•  message-based  communication  with 
nonshared  memories, 

•  unifying,  high-level  operating  system,  and 

•  transparency. 

Frequently,  parallel  systems  arc  described  as 
being  tightly  coupled;  distributed  systems  are 
described  as  being  loosely  coupled. 


FORMS  OF  PARALLELISM 
AND  T  AXONOMIES 

Classification  of  Architectures 

There  are  numerous  fonns  of  parallelism. 

They  range  from  bit-level  concurrent  processing 
and  simultaneous  execution  of  various  functions 
(such  as  input/output,  instruction  execution,  and 
communication)  to  separate  concurrent  execution  of 
multiple  programs  on  separate  computers.  The 
levels  of  parallelism  can  be  classified  as  follows. 

•  Bit  level— all  contemporary  processors  work 
on  multiple  bits  (usually  32,  currently 
changing  to  64)  at  a  time,  and  the  commu¬ 
nication  between  processor,  memory,  and 
input/output  (I/O)  is  parallel  in  almost  every 
case.  A  number  of  multiprocessor  systems 
also  use  single-bit  processors  (ASP,  DAP, 
CM-1,  CM-2). 

•  Instruction  level— two  types  of  parallelism 
may  occur  at  this  level.  First  is  an  instruc¬ 


tion  pipeline  where,  for  example,  fetch  and 
operation  may  overlap  in  time.  This  means 
that  instruction  II  is  fetched  in  Step  1,  then 
in  Step  2;  while  11  is  executed,  12  lo  being 
fetched.  When  12  is  executed,  13  is  leing 
fetched.  This  process  is  repeated  until  all 
instructions  are  executed.  If  there  is  an 
unexpected  change  in  control  flow,  the 
instruction  that  was  fetched  is  ignored  and 
an  appropriate  branching  occurs.  The  other 
form  of  parallelism  may  occur  by  concur¬ 
rent  execution  of  more  than  one  instruction, 
or  of  the  sam.e  instruction  with  different 
data  on  multiple  arithmetic-logic  units. 

•  Task  level— multiple  tasks  are  executed  on 
different  processors  and  are  coordinated  to 
complete  a  particular  job. 

^  Job  level— multiple  jobs  (programs)  are 
independently  executed  on  different  proces¬ 
sors.  For  example,  popular  parallel  com¬ 
puters  such  as  Sequent  may  operate  in  this 
mode  where  each  processor,  upon  comple¬ 
tion  of  a  particular  job,  gets  a  new  job  to 
execute  from  the  centralized  job  queue. 

Flynn’s  Taxonomy 

Flynn  introduced  a  taxonomy  of  computer 
architecture  based  on  instruction  data  streaii 

•  Single-instruction,  single-data  stream  (SISD) 
represents  mainly  single  processing  units, 
although  they  may  be  pipelined. 

•  Single-instruction,  multiple-data  stream 
(SIMD)  corresponds  to  processor  arrays 
that  execute  the  same  instruction  on  differ¬ 
ent  data.  For  example,  a  checking  account 
transaction  could  be  processed  in  this  mode. 
The  program  for  each  transaction  is  the 
same,  only  the  data  are  different.  This 
mode  would  allow  multiple  processors  to 
simultaneously  execute  the  same  program 
with  different  data. 

•  Multiple-instruction,  single-data  stream 
(MISD)  occurs  when  multiple  processors 
operate  on  the  same  data  For  example,  the 
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same  description  of  an  image  (data)  could 
be  processed  on  different  computers,  there¬ 
by  producing  different  perspectives  of  the 
same  image.  This  form  of  parallelism  can 
also  be  found  in  microprograms  where 
different  operations  can  be  executed  on  the 
same  data. 

•  Multiple-instruction,  multiple-data  stream 
(MIMD)  reflects  processing  where  dil^erent 
tasks  or  jobs  are  executed  on  different  data 
on  multiple  conqputers. 

Because  several  multiprocessor  systems  sup¬ 
port  SIMD  or  MIMD  modes  of  operation,  an 
additional  term  was  coined  (SMIMD),  which  re¬ 
flects  the  ability  of  some  multiprocessors  to  sup¬ 
port  both  modes  of  operation.  Also,  MSIMD 
represents  multiple  SIMD  streams  that  may  occur 
in  some  processor  arrays. 

HMndler’s  Taxonomy 

Hftndler  has  proposed  a  notation  that  describes 
parallelism  and  pipelhdng  at  three  levels  of  a  com¬ 
puter  system:  tlie  processor  control  unit  (PCD), 
the  arithmetic  logic  unit  (ALU),  and  the  bit-level 
circuit  (BLC). 

A  computer  is  characterized  by  three  pairs  of 
integers: 

T{c)  »  (A’x/f'.  DXD',  WxW'), 

where 

T(c)  is  type  of  computer  c, 

K  is  the  number  of  PCUs, 

K*  is  the  number  of  PCUs  that  can  be 
pipelined, 

D  is  the  number  of  ALUs  controlled 
by  each  PCU, 

D'  is  the  number  of  ALUs  that  can  be 
pipelined, 

W  is  the  number  of  bits  in  an  ALU  or 
processing  element  (PE)  word,  and 

W  is  the  number  of  pipeline  segments  in  all 
ALUs  or  in  a  single  PE. 

If  the  value  of  second  element  in  any  pair  is  equal 
to  1,  then  it  is  omitted. 


A  single-CPU  Cray-1  with  16  functional  units, 
8  of  which  can  be  pipelined  and  each  functional 
unit  with  1  to  14  6^bit  wide  segments,  can  be 
described  as 

7)[Cray-l)  =  [1.  16x8,  64x(l  ~  14)]. 
Methods  Used  to  Achieve  lUgfaer  Performance 
Valiant’s  Taxonomy 

A  parallel  random-access  machine  (PRAM) 
model  that  deals  mainly  with  access  rights  was 
proposed  by  Valiant.^  With  the  PRAM  model. 
Valiant  proved  universality  of  parallel  computa¬ 
tions,  which  is  similar  to  Ae  Turing  result  for  the 
uniprocessor  machine.  This  idealized  model  allows 
each  of  the  processors  to  write  and  read  into  other 
memories  in  one  machine  cycle,  ignoring  commu¬ 
nication  overheads.  Three  types  of  PRAM  models 
are  proposed,  based  on  exclusivity  of  concurrent 
reads  and  writes. 

1.  Seclusive  PRAM  (SPRAM)— concurrent 
reads  or  writes  to  a  given  memory  location 
by  more  than  one  processor  are  not  al¬ 
lowed: 

2.  Exclusive  PRAM  (EPRAM)— more  than 
one  processor  may  access  the  same  memo¬ 
ry  module,  but  only  one  processor  can 
access  a  single  memory  location  at  a  time; 
and 

3.  Concurrent  PRAM  (CPRAM)— more  than 
one  processor  is  allowed  to  access  the  same 
store  location  simultaneously. 

Valiant’s  models  are  conceptually  important  and 
push  the  previously  discussed  taxonomy  even 
further.  A  grand  challenge  will  be  to  validate  them 
in  practice.  The  significance  of  combining  (an 
access  to  a  single  memory  location  by  multiple 
processors  simultaneously)  for  con^utationally 
intensive  problems  is  still  an  open  question. 

Memory  Model-based  Taxonomy 

Distributed  Memo  r  Model 

This  model,  also  known  as  a  message-passing 
or  loosely-coupled  system,  is  widely  accepted  by 
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computer  science  theoreticians,  who  like  its  logical 
elegance  and  extensibility.  Languages  such  as 
OCCAM  can  handle  communication  and  partitioning. 
Input/output  and  workload  distribution  pose  several 
problems,  and  no  widely  accepted  language  for 
applications  is  available.  One  of  the  major  diffi¬ 
culties  from  a  user’s  point  of  view  is  lack  of  conti¬ 
nuity  (portability)  and  the  necessity  for  developing 
applications  from  scratch.  (This  issue  of  continu¬ 
ity,  the  ability  for  the  user  to  move  smoothly  from 
current  to  future  programming  practice,  is  critical. 
There  is  no  compiler  that  can  efficiently  handle  a 
distributed-memory  model.)  Nevertheless,  there  is 
strong  support  for  this  model,  based  on  the  belief 
that  scalability  and  elegance  will  pay  off.  There¬ 
fore,  several  major  European  Strategic 
Programmes  for  Research  and  Development  in 
Information  Technologies  (ESP'-UT)  programs  are 
under  way. 

Shared  Memory  Model 

Shared  memory,  as  in  a  conventional 
uniprocessor,  provides  a  global  address  space  and 
allows  easy  programming  in  conventional  languag¬ 
es  such  as  Fortran,  Pascal,  C,  Lisp,  and  Prolog. 

In  short,  it  offers  "continuity. "  Simple  SIMD-like 
parallelization  (e.g.,  DO  loops)  can  easily  be 
achieved  by  using  state-of-the-art  compilers. 

Shared  Virtual  Memory  Model 

A  physically  distributed  memory  that  appears 
to  a  programmer  as  a  shared  memory  supporting  a 
global  address  space  is  known  as  shared  virtual 
memory  (SVM).^’^  The  SVM  implies  a  sharing  of 
data  stored  in  local  virtual  memories  with  paging 
capability.  Implementation  is  one  of  the  hottest 
topics  in  computer  architecture  research  today. 

The  SVM  promises  continuity  for  users  by  provid¬ 
ing  a  global  address  space  and  ease  of  implementa¬ 
tion.  Ihis  model  can  also  be  viewed  as  shared 
memory  with  caches  in  secondary  storage,  in  con¬ 
trast  to  a  pure  shared-memory  model  that  is  consid¬ 
ered  to  be  without  caches  or  secondary  storage. 

Hybrid  Memory  Model 

This  model  combines  both  shared  and  distrib¬ 
uted  memory.  An  example  of  its  implementation  is 
on  the  IBM’s  Research  Parallel  Processor  Proto¬ 


type  (RP3)^  where  simple  "fence  registers"  delin- 
eau  boundaries  between  local  and  global  space  for 
each  processor.  With  order  of  tens  of  nanoseconds 
delay  over  a  64-way  multistage  network,  a  shared 
memory  carries  a  relatively  Small  penalty  for  the 
convenience  of  global  access. 

Some  researchers  believe  that  this  approach  is 
not  scalable  although  others  believe  the  opposite  is 
true.®*^  State-of-the-art  techniques  and  technology 
may  support  several  thousand  processor  systems. 
With  the  continued  fast-pace  progress  in  technolo¬ 
gy  and  some  advances  in  architecture,  further 
extensibility  is  assured.  Optical,  free-space  inter¬ 
connection  networks  may  provide  a  quantum  leap 
in  this  area. 

There  are  four  fundamental  methods  of  im¬ 
proving  computer  performance. 

1 .  Computer  technology.  Traditionally,  com¬ 
puter  technology  has  been  the  major  force 
in  improving  performance  of  classical  ma¬ 
chines  that  are  based  on  von  Neumann’s 
architecture.  Computer  speeds  have  main¬ 
tained  an  order  of  magnitude  or  more  im¬ 
provement  every  five  years  and  the  pace 
increases. 

2.  Computer  organization  of  a  single  comput¬ 
er.  The  improvements  to  software  and 
hardware  organization,  overlapping  in¬ 
put/output  and  processing  operations,  mem¬ 
ory  interleaving,  multithreading,  instruction 
pipelining,  and  memory  management  have 
an  impact  on  improving  the  overall  comput¬ 
er  performance. 

3.  Algorithms.  Improvements  in  algorithms 
has  had  a  profound  impact  on  computer 
performance.  Several  approaches  to  many 
difficult  problems  have  been  consistently 
improved,  resulting  in  enhanced  perfor¬ 
mance  of  some  specific  applications  from 
ten  to  several  thousand  times. 

4.  Multicomputing.  By  connecting  multiple 
processors,  system  performance  can  be 
improved.  Several  systems  offer  perfor¬ 
mance  improvement  in  orders  of  magnitude 
over  a  single  microprocessor. 
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3.  HIGH-PERFORMANCE  COMPUTER 
ARCHITECTURES 

INTRODUCTION 

This  report  focuses  is  on  high-performance 
computer  architectures.  High-performance  com¬ 
puting  here  is  perceived  more  broadly  and  includes 
both  parallel  and  distributed  computing.  This 
section  discusses  and  assesses  major  European 
research  and  commercial  efforts.  A  vast  body  of 
literature  exists  that  describes  in  detail  most  of  the 
cited  systems.  This  report  focuses  on  salien  and 
unique  system  features  -avoiding  mundane  descrip¬ 
tions  of  widely  known  concepts. 

European  research  efforts  are  diverse  and 
numerous.  It  is  difficult  if  not  impossible  to  tell 
the  whole  story  and  justly  evaluate  all  ongotn;; 
efforts.  The  projects  described  appear  to  be  the 
most  viable  ones,  but  the  list  is  by  no  means  ex¬ 
haustive.  It  reflects  the  author’s  personal  experi¬ 
ences  based  on  participation  in  European  confer¬ 
ences  and  workshops,  discussions  with  colleagues, 
and  contacts  with  various  European  research 
groups. 

Descriptions  of  various  high  performance 
systems  have  been  divided  into  three  classes; 


es  that  are  occurring  in  computing.  The  idea 
originated  in  Bristol,  U.K.,  at  a  small  company 
called  Inmos.  Imnos,  after  being  bought  by  the 
giant  record  manufacturer  EMI,  finally  was  pur¬ 
chased  by  the  Franco-Italian  concera  SGS- 
Thomson,  which  is  committed  to  making  Inmos  the 
miijor  supplier  of  components  for  European  high- 
performance  computing. 

The  previous  phase  of  the  ESPRIT  program 
resulted  in  the  capability  to  develop  a  1 ,024  pro¬ 
cessor  system  known  as  a  "supemode."  At  present 
several  companies  (primarily  Meiko,  Parsys, 
Parsytec,  and  lelmat)  are  able  to  deliver  a  variety 
of  multiprocessor  systems.  A  new  parallel  pro¬ 
cessing  initiative,  v/itb  a  budget  of  $120  million, 
began  with  the  deployment  of  55  systems  support¬ 
ing  more  than  200  researchers.  It  is  expected  that 
about  300  students  will  graduate  each  year  to 
support  the  growing  multiprocessor  industry. 
ESPRIT  already  has  a  number  of  programs  in 
place;  examples  include: 

•  PUMA  (architectures), 

•  NONA  (algorithm  development), 

•  COIdPARE  (compiler  technologies),  and 

•  REX  (development  and  management  of 
parallel  and  distributed  systems. 


•  general-purpose  parallel  systems, 

•  special-purpose  parallel  systems,  and 

•  distributed  computing  systems, 

but  these  divisions  are  more  customary  than  real. 

In  my  opinion,  there  is  a  continuum  across  these 
systems,  and  it  is  difficult— if  not  impossible— to 
tell  when  a  particular  general-purpose  system 
becomes  special-purpose  and  vice  versa.  My 
decision  for  classifying  a  particular  system  is  based 
on  classical  taxonomy,  which  distinguishes  between 
SIMD  and  MIMD  systems;  scalar,  vector,  and 
parallel  processing;  as  well  as  between  parallel  and 
distributed  computing. 

TRANSPUTERS  AND  OTHER  COMPONENTS 

A  number  of  European  efforts  in  high-perfor¬ 
mance  computing  are  centered  around  a  transputer 
(a  microprocessor  that  is  specifically  designed  for 
parallel  con^uter  systems).  The  turbulent  history 
of  transputer  development  reflects  the  rapid  chang¬ 


In  the  post-RISC  (Reduced  Instruction  Set 
Computer)  Supemode-2  (GP-MIMD)  microproces¬ 
sor  initiative,  Meiko.  Parsys,  Telmat,  and  Inmos 
will  be  involved  in  a  $120  million  development 
effort  to  create  the  next  generation  of  super¬ 
computers. 

A  new  transputer,  called  the  T9000  (previously 
the  HI),  will  run  at  rates  of  200  MIPS  peak  and  70 
MIPS  sustained,  in  addition  to  performance  of 
more  than  25  MFLOPS  peak  and  IS  MFLOPS 
sustained.  New  bidirectional  conununication  links 
will  be  able  to  deliver  80  Mbytes/s  by  multiplexing 
logical  channels  on  physical  links.  The  T9()00  will 
support  conventional  operating  system  features 
such  as  multiple  priority  levels,  memory  protec¬ 
tion,  and  relociuion.  An  enhanced  process  model 
will  support  real-time  kernels  and  operating  sys¬ 
tems.  A  major  change  is  in  the  support  for 
interprocessor  communication.  New  protocols  will 
ensure  high  rates  and  low  latency  by  using 
multiplexing  and  automatic  routing.  Binary  com¬ 
patibility  with  the  present  transputer  family  will  be 
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maintained,  and  Inmos  will  support  mixed  Tx/Hx 
series  (Tx  series  indicates  the  transputer  family  of 
products  such  as  T400,  T800,  T9000,  and  others). 
Improved  performance  for  multiprocessor  assem¬ 
blies  will  come  together  with  ease  of  use.  The 
new  chip  will  include  a  pipelined  superscalar  pro¬ 
cessor,  floating-point  unit,  16K  cache,  communica¬ 
tion  processor,  and  programmable  memory  inter¬ 
face  manufactured  in  advanced  CMOS  technology. 
Up  to  16  Mbytes  of  RAM  can  be  attached  to  each 
processor  without  any  additional  components,  and 
up  to  4  Gbytes  can  be  added  in  total. 

In  the  future,  Inmos  will  continue  with  the 
T-series,  but  the  emphasis  will  be  on  applica¬ 
tion-specific  transputers  (the  ASIC  T  series; 

ASIC— app' cation-specific  integrated  circuit).  In 
the  high-performance  end  of  the  T  series  will  be 
mainline  products  such  as  T9(XX),  while  at  the 
same  time,  a  number  of  ASICs  will  be  designed 
with  specific  functions  (as  requested  by  system 
houses  and  large  users).  The  migration  of  software 
into  hardware  will  continue.  The  post-T  genera¬ 
tion  of  transputers  (to  be  introduced  in  1993),  the 
E-series,  is  geared  to  meet  the  requirements  of  the 
European  Microinitiative  (EMI).  EMI's  aim  is  to 
define  and  support  a  wide  range  of  open  systems. 
Inmos  already  claims  the  lowest  cost  per  MIP  (less 
than  $2).  With  its  new  infusion  of  capital  because 
of  its  purchase  by  SGS-Thomsom  in  1989,  it  may 
become  one  of  the  main  forces  in  the  microproces¬ 
sor  industry  in  the  world. 

Another  interesting  development  is  the  Inmos 
commitment  to  the  design  and  in^lementation  of  a 
variety  of  communication  ASIC  chips.  The  ClOx- 
series  will  be  capable  of  connecting  transputer 
products  to  any  bus.  The  C1(X)  chip,  a  converter, 
will  support  communication  between  the  high  and 
low  end  of  the  Tx  series  of  transputers,  which  will 
have  a  new  generation  of  links.  The  0004,  32- 
way  crossbar  switch,  will  be  rq>iaced  by  the  new 
C104  chip,  with  improved  protocols  and  the  latest 
tedinology.  Links  for  a  number  of  products  will 
also  be  develq)ed. 

The  links  for  the  T9000  processor  will  enable 
conununication  between  proc»s«i  on  Uie  same  chip 
via  monory,  as  well  as  support  conununication 
between  processes  on  different  chips.  The  bidirec¬ 
tional,  self-synchronizing,  virtual  conununication 


channels  use  send-back  protocols  with  an  up-to- 
32-byte  packet  format  having  an  end-of-message 
(EOM)  byte.  Splitting  messages  into  packets  is 
automatic,  and  wormhole  routing  is  used.  The 
communication  is  deadlock  free;  addressing  is 
provided  by  a  special  interval  numbering  scheme. 
The  new  routing  chips  will  provide  communication 
capability  for  many  heterogeneous  devices  and 
networks,  such  as  Ethernet  and  token-passing 
rings. 

Companies  manufacturing  the  transputer-based 
systems  are  growing  in  both  size  and  number.  For 
example,  Meiko,  Parsys,  Parsytec,  and  Telmat 
have  developed  powerful  "supemodes,"  which 
contain  up  to  16  transputers  on  a  single  board  (see 
the  section  on  General-Purpose  Parallel  Computers 
of  this  report).  According  to  manufacturers, 
"meganodes"  may  have  as  many  as  1,024  trans¬ 
puters.  To  my  knowledge,  however,  the  largest 
existing  systems  do  not  have  more  than  500  pro¬ 
cessors. 

The  U.S.  research  community  is  well  acquaint¬ 
ed  with  transputers.  A  number  of  meetings  on 
transputers  are  held  each  year,  and  several  transpu¬ 
ter-based  systems  were  developed  at  IBM  Research 
in  Yorktown  Heights.  During  1988-1990  several 
multiprocessor  systems  with  16,  32,  64,  and 
256-transputer  systems  were  implemented  with  a 
number  of  applications.  These  applications  range 
from  modeling  oceanic  wind  currents  to  fault 
simulation  and  superconductivity.  The  goal  of  this 
project  was  to  provide  researchers  with  a  platform 
for  experiments  in  message-passing  MIMD  parallel 
processors  with  distributed  memory.  The  Victor 
v2S6  is  a  16x16  mesh  of  transpuleis  that  delivers 
almost  linear  speedup  for  a  number  of  applications. 
V2S6  performs  especially  well  for  computationally 
intensive  Monte  Carlo  c^es. 

Inmos  nukde  history  with  their  delivery  of 
transputers,  the  first-ever  parallel  processor- 
oriented  systems,  in  the  early  19808.  With  its 
latest  T9000  and  family  of  communication  compo- 
rants,  Innws  cominues  to  create  interesting  prod¬ 
ucts.  Despite  delayed  T9000  and  competitive 
annouiKements  from  Intel,  Motorola,  Apple-Moto- 
rola-lBM,  Hewlett-Packard  (HP),  and  Digital. 

Inmos  may  still  maintain  its  leading  role  in  Europe 
because  it  is  the  only  European  manufacturer  of 
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high-performance  microprocessors.  It  continues  to 
produce  unique,  still-highly-competitive  products, 
especially  with  respect  to  price/performance. 

To  obtain  a  bibliography  containing  more  t*'an 
1,000  entries  on  transputers  and  Occam  parallel 
language,  contact; 

Michael  Poole 

Inmos  Ltd. 

1000  Aztec  West 

Almondsbury,  Bristol  BS12  4SQ 

United  Kingdom 

E-mail;  ong@uk.co.inmos 

Phone;  +44-454-616616 

GENERAL-PURPOSE  PARALLEL 
COMPUTERS 

Meiko,  Parsys,  Parsytec,  and  Others 
Synopsis 

A  number  of  European  start-up  computer 
companies  took  up  the  challenge  of  designing  and 
manufacturing  parallel  computers.  Their  efforts 
initially  focused  on  transputer-based  systems,  but 
over  die  years  some  of  them  branched  off  to  other 
microprocessors  (such  as  Intel’s  i860).  The  efforts 
of  these  companies  resulted  in  the  development  of 
ESPRIT-supported  Supemode  systems,  which 
could  be  assembled  with  up  to  IK  T800 
transputers. 

Oirrently,  three  of  the  companies  [Meiko 
Scientific  Ltd.  (U.K.),  Parsys  Ltd.  (U.K.),  and 
Telmat  Informatique  (France)]  are  involved  in  a 
GP-MIMD  consortium  whose  goals  are  to  develop 
scalable,  parallel  systems  with  portable  software 
(described  later).  Parsytec  [Federal  Republic  of 
Gemuuiy  (FRG))  plans  to  combine  64K  T9000 
transputers  to  develop  a  TeraFLOPS  supercompu¬ 
ter. 

These  conqianies  continue  to  develop  their 
own  systems;  salient  features  of  their  respective 
approaches  are  described. 

Meiko 

Meiko  is  a  spinoff  from  Inmos;  it  was  founded 
in  1985,  also  in  Bristol  (a  few  blocks  from  Inmos). 
It  initially  was  geared  toward  the  design  and  devel¬ 


opment  of  transputer-based  systems.  The  company 
boasts  the  largest  number  of  installed  parallel 
systems  worldwide  (more  than  350).  It  participat¬ 
ed  in  ESPRIT’S  Supemode  project  and  is  presently 
cooperating  in  GP-MIMD.  The  current  approach 
is  cidled  Computing  Surface;  its  goal  is  to  provide 
a  software-level  abstraction  above  a  heterogeneous 
hardware  environment. 

Meiko’s  hardware  surfaces  today  comprise 
transputers  as  well  as  Intel  and  Sparc  microproces¬ 
sors.  Meiko  offers  a  wide  range  of  products— 
from  supemodes  with  up  to  IK  processors  to  OR¬ 
ACLE  parallel  servers  (in  partnership  with  Oracle, 
a  U.S.  company).  It  has  offices  in  both  England 
and  the  United  States.  Meiko’s  ORACLE  scalable 
parallel  server  provides  throughput  from  30  to 
1,000  transactions  per  second.  The  parallel  rela¬ 
tional  database  server  can  be  used  in  stand-alone, 
networked,  and  VAXaccelerator  conftgurations. 
Unique  features  include  shared  buffer  caches 
(which  allow  database  access  without  disabling 
"fast  commit/deferred  write"  or  "group  commit"), 
transparency  with  VAX  systems,  and  real  time  (the 
response  times  can  be  guaranteed  for  on-line  trans¬ 
action  processing).  Three  major  components  (pro¬ 
cessing  node,  special  I/O  node,  and  disk  node) 
allow  flexible  system  configuration  to  efficiently 
meet  diversified  users  requirements. 

Parsys 

Parsys  Ltd.  is  a  British-based  parallel  UNIX 
systems  manufacturer  that  was  formed  in  1988  as  a 
spinoff  from  ESPRIT’s  supernode  initiative. 

A  supernode  architecture  developed  by  Parsys 
has  a  modular  structure.  This  allows  machines  of 
arbitrary  size  to  be  assembled,  to  a  maximum  of 
1024  transputers.  The  basic  module  consists  of  16 
worker  transputers  with  memory;  a  controller 
transputer  with  read-only  memory  (ROM)  and 
random-access  memory  (RAM)  (it  has  memory 
mapped  into  each  transputer);  two  ports  for  I/O 
controllers  (e.g.,  disk  controllers);  and  a  pair  of 
crossbar  switches.  These  switches  can  provide 
direct  iiqiut/output  links  for  up  to  72  transputers. 
Larger  systems  of  up  to  1024  worker  transputers 
use  a  second  "outer"  level  of  crossbar  switching 
elements  and  connect  up  to  64  16-transputer  sub¬ 
systems.  This  architecture  provides  an  excellent 
connectivity  that  requires  traversal  of  a  maximum 
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of  three  switching  elements  along  any  communica* 
tion  path.  High  system  configurability  allows  a 
great  deal  of  flexibility  in  task  allocation;  A  full 
system  configuration  can  attain  up  to  1.6  GFLOPS. 

The  SuperNode  1000  Series  is  provided  with 
IDRIS,  which  is  a  POSIX-  and  UNIX-compatible 
operating  environment.  The  IDRIS  consists  of  a 
multiuser,  multitasking  executive  and  a  wide  range 
of  utilities.  Compilers  for  C,  Pascal,  and  Fortran 
are  available,  along  with  a  vector  and  numerically 
efficient  procedure  libraries  written  in  transputer 
assembler. 

Parsys,  like  Meiko,  has  a  partnership  with 
Oracle  Corporation  and  offers  a  full  implementa¬ 
tion  of  SQL  (standaid  query  language),  the  indus¬ 
try-standard  relational  database  language. 

In  cooperation  with  PEGS  Ltd.  [the  acknowl¬ 
edged  leader  in  the  field  of  finite-element  analysis 
(FEA)  software],  Parsys  has  developed  an  FEA 
machine  that  is  competitive  with  conventional 
supercomputer  systems  such  as  Cray. 

Several  performance  enhancers  such  as  frame 
grabbers  and  fast  I/O  .^rds  are  also  offered  to 
improve  overall  performance,  especially  in  applica¬ 
tions  such  as  simulation,  signal  processing,  and 
visualization. 

Parsytec 

Founded  in  the  FRG  in  1985,  Parsytec  was 
among  the  pioneering  companies  that  turned  trans¬ 
puter  concepts  into  working  multiprocessor  sys¬ 
tems.  Main  products  of  the  company  are  the 
Multicluster  and  SuperCluster  series; 

•  Multicluster- 1  series  host  up  to  30  proces¬ 
sors  in  a  single  rack; 

•  MultiCluster-2  series  have  from  16  to  64 
processors  in  one  rack;  and 

•  SuperCluster  series  conqirise  from  64  to 
1024  transputers  in  a  single  frame. 

Single  interfaces  allow  these  machines  to  connect 
to  FC,  SUN,  and  VAX  systems. 

MuUiCluster-1  series  are  statically  configurable 
systems  and  can  be  tailored  to  specific  user  re¬ 
quirements  such  as  number  of  processors,  amount 
of  memory,  and  I/O  configuration,  as  well  as 


system  topology.  The  required  processor  topology 
can  be  configured  by  using  UniLink  connections 
fed  through  the  special  back  plane.  In  addition, 
four  external  sockets  are  provided. 

Multinuster-2  uses  network  configuration 
units  (NCUs)  that  provide  flexible,  dynamically 
configurable  interconnection  networks.  The  multi¬ 
user  environment  can  support  up  to  eight  users  by 
using  Parsytec’s  multiple  virtual  architecture  soft¬ 
ware.  The  NCU  design  is  based  on  the  Inmos 
crossbar  switch,  the  C004,  which  gives  full  cross¬ 
bar  connectivity  for  up  to  16  transputers.  Each 
NCU,  made  of  C004S,  connects  up  to  96  UniLinks 
that  link  internal  as  well  as  externa)  transputers  and 
other  I/O  subsystems.  MultiCluster-2  provides  the 
ability  to  configure  a  variety  of  fixed  interconnec¬ 
tion  topologies  such  as  tree  or  mesh  structures. 

SuperCluster  has  a  hierarchical,  cluster-based 
design.  A  basic  unit  is  a  16-transputer,  fiilly  con¬ 
nected  cluster;  larger  systems  have  additional  levels 
of  NCUs  to  form  necessary  connections.  The 
Network  Configuration  Manager  (NCM)  software 
controls  the  NCUs  and  dynamically  establishes  the 
required  connections.  Each  transputer  can  be 
equipped  with  1  to  32  Mbytes  of  dynamic  RAM 
with  single-error  correction  and  double-error  detec¬ 
tion. 

Parsytec’s  software  is  based  on  the  Helios 
operating  system  and  the  MultiTool  development 
environment,  which  gives  a  UNIX-like  operating 
environment.  In  1991  Parsytec  announc^  plans  to 
build  a  system  of  65,536  (64K)  T9000  transputers 
using  a  hypercube  interconnection  network;  this 
project  is  still  under  way.  This  system  could 
potentially  deliver  aggregated  performance  exceed¬ 
ing  one  trillion  floating-point  operations  per  sec¬ 
ond!  Parsytec  plans  to  have  the  system  operational 
by  1993,  but  this  delivery  date  seems  overly  opti- 
niistic.  With  delays  in  T9000  availability,  high 
initial  cost,  and  several  technical  problems  that 
must  be  solved,  this  ambitious  project  simply  may 
not  be  ready  by  1993. 

It  will  take  at  least  two  to  three  years  to  attain 
expertise  to  assemble  so  many  con^)onents.  With 
the  rapid  pace  in  microprocessor  technology,  it 
may  turn  out  that  in  three  years  we  will  ne^  only 
IK  to  2K  processors  to  break  the  TeraFLOPS 
barrier. 
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Yelmat 

Telmat  Informatique  is  a  French  company  that 
provides  parallel  transputer-based  systems  with 
performance  ranging  from  30  MFLOPS  to  2 
GFLOPS.  T.Node  systems  (with  8  to  64 
transputers)  and  MEGA  Nodes  (with  128- 1024 
transputers)  offer  a  software  environment  similar  to 
Parsytec,  with  a  Helios  operating  system, 
X“Windows,  and  support  for  C,  Fortran,  Pascal, 
OCCAM,  Ada,  and  additionally  for  Strand-88.  The 
company  is  the  biggest  seller  of  parallel  systems  in 
France. 

Transtech 

Transtech,  a  company  based  in  High  Wycom¬ 
be,  near  London,  specializes  in  MIMD  parallel 
systems,  with  emphasis  on  heterogeneous  technolo¬ 
gy  and  open  systems.  It  produces  a  range  of 
multiprocessor  and  Zoran  vector  processor  sys¬ 
tems.  The  systems  can  be  installed  as  accelerators 
inside  personal  computers  (PCs)  or  SUN  Work¬ 
stations,  attached  to  a  VME  bus,  or  they  can  oper¬ 
ate  as  stand-alone  systems. 

The  multiprocessor  systems  are  assembled  by 
using  a  staged  VME  bus  or  a  crossbar  connection 
chip  from  Inmos.  The  performance  ranges  from  1 
MFLOPS  to  2  GFLOPS.  Transtech  provides  a 
gamut  of  accelerator  boards— ranging  from  high- 
resolution  graphics  modules  to  fast  interfaces  and 
digital/analog  converters. 

The  software  includes  C,  Fortran,  (XCAM,  and 
Ada  compilers  as  well  as  comprehensive  mathemat¬ 
ics  and  graphics  libraries,  A  GENESYS  II  Operat¬ 
ing  System  is  used.  It  furnishes  support  for  sym¬ 
bolic  debuggers,  optimization  tools,  and  automatic 
configuration  of  processors. 

SUMMARY 

There  are  at  least  20  parallel  computer  compa¬ 
nies  in  Europe,  many  of  them  branching  off  to  the 
U.S.  and  positioning  themselves  as  American 
companies. 

While  initially  most  of  these  companies  were 
mainly  transputer  platform  manufacturers,  they  are 
currently  open  to  overseas  technologies.  The 
number  of  companies  staking  their  future  on  Intel’s 
i860,  MIPS,  Rx(X}0  series,  and  Spares  is  grow- 


iitg;  with  market  globalization  it  soon  will  be  diffi  ¬ 
cult  to  tell  whether  a  given  company  is  European 
or  American. 

With  the  proliferation  of  parallel  systems, 
Europe  is  quickly  gaining  experience  in  parallel 
system  design  and  software  development.  It  may 
turn  out  that  the  Parsytec’s  challenge  to  develop  a 
TeraFLOPS  computer  by  1993  is  not  quite  hollow. 

GENERAL-PURPOSE  MIMD  COMPUTER 
(GP-MIMD) 

Synopsis 

The  General-Purpose  Multiple-Instruction 
Multiple-Data  (GP-MIMD)  parallel  computer 
project  is  a  concened  effort  by  four  major  compa¬ 
nies  in  partnership  with  several  associated  contrac¬ 
tors  from  research  laboratories,  companies,  and 
universities.  Its  objective  is  to  create  a  scalable 
parallel  computer  with  portable  software.  The 
main  partners  are  Inmos  (T9000  processor  manu¬ 
facturer  from  Bristol.  U.K.)  and  three  manufactur¬ 
ers  of  transputer-based  systems— Meiko  (U.K.), 
Parsys  (U.K,),  and  Telmat  (France).  Associated 
contractors  include  Grupo  APD  (Spain);  the  Euro¬ 
pean  High-Energy  Research  Center  (CERN,  Gene¬ 
va);  Chorus  Systems  (France);  Institute  de 
Enghenaria  de  Sistemas  e  Computadores  (INESC, 
Lisbon);  Siemens  (FRG);  Swedish  Institute  of 
Computer  Science,  Southampton  University 
(U.K.);  and  Institut  de  Recherche  en  Informatique 
et  Systemes  Aleatoires  (IRIS A,  France). 

In  addition  to  scalability  and  portability,  major 
design  objectives  include: 

1.  Software 

•  Application  support  interface 

(ASI) 

•  Ease  of  parallel  programming 

•  Architectural  independence. 

2.  Hardware 

•  Standard  I/O  interfaces 

•  Cost-effectiveness 

•  Fault  tolerance 

•  Simplicity 

•  Ease  of  manufacturing. 
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This  impressive  list  of  goals  boils  down  to  one 
issue;  Bring  parallel  computing  to  the  mass  market 
where  a  personal  computer  user  can  comfortably 
work  on  parallel  machines,  while  software  houses 
can  invest  in  a  new  generation  of  software.  Hope¬ 
fully,  software  houses  can  build  on  a  common 
software  layer.  The  partner  companies  have  estab¬ 
lished  a  common  transputer-based  platform  for 
supemodes.  Their  joint  work  toward  standardiza¬ 
tion  may  have  a  lasting  impact  on  parallel  comput¬ 
ing.  Likely,  GP-MIMD  will  be  remembered  as  the 
pioneer  of  viable  standards  for  parallel  computing, 
but  not  for  breakthrough  performance.  The  key  to 
success  will  be  a  consensus  on  standards  and  their 
international  acceptance.  This  is  a  truly  exciting 
endeavor,  similar  to  the  High-Performance  Fortran 
initiative  in  the  United  States.  If  standards  are 
accepted  (it  may  take  a  miracle  for  this  to  happen), 
the  portability  issue  would  be  resolved  once  and 
for  all. 

The  initial  GP-MIMD  architecture  will  consist 
of  1,024  T9000  transputers  connected  by  a  multi¬ 
stage  network.  A  prototype  should  be  ready  at  this 
time,  and  commercial  availability  is  expected  in 
late  1993.  This  is  a  truly  aggressive  schedule,  and 
it  remains  to  be  seen  whether  it  can  be  met.  If  the 
goal  of  scalability  in  both  hardware  and  software 
prototypes  is  attained,  systems  of  one  or  two  or¬ 
ders  of  magnitude  iarger  than  those  currently  avail¬ 
able  could  be  built. 

Architecture 

Major  concerns  of  this  project  are  scalability, 
standardization,  and  portability.  The  overall  goal 
of  scalability  is  to  guarantee  the  linear  growth  of 
the  machine  performance  with  the  number  of  pro¬ 
cessors.  One  of  the  keys  to  scalability  is  the  abili¬ 
ty  to  maintain  constant  communication  bandwidth 
as  the  machine  grows  in  size.  The  GP-MIMD 
organization  is  based  on  a  standard  network  that  is 
a  multistage,  rearrangeable,  nonblocking,  parallel 
co^^>uter  network.  The  byte  mode  communication 
network  is  expected  to  be  able  to  deliver  5  Mbyte/s 
bandwidth  for  a  32-processor  system  and  about 
3  Mbyte/s  for  a  S  12-processor  parallel  computer. 
Simulation  studies  indicate  that  the  Clos  network 
performs  consistently  better  than  a  grid  and  that  it 
supersedes  a  hypercube  for  systems  with  more  than 
128  processors. 


Random  routing  on  this  network  eliminates  hot 
spots,  and  adaptive  routing  promises  further  per¬ 
formance  improvements.  Another  feature,  an  extra 
stage,  boosts  the  performance  (a  similar  phenome¬ 
non  has  been  observed  in  an  omega  network).  The 
processing  mode  should  offer  not  only  high  perfor¬ 
mance— it  should  also  support  elTicient  re.<;chedul- 
ing  and  rerouting. 

The  Inmos  T9000  processor  will  offer  200 
MIPS  peak  (70  MIPS  sustained),  25  MFLOPS 
peak  (15  MFLOPS  sustained),  80  Mbyte/s  total 
link  bandwidth,  and  16  Kbyte  cache.  Initial  con¬ 
figurations  use  T800,  i860,  and  Sparc  processors; 
prototype  systems  are  available  now.  The  key  goal 
of  the  design  is  to  make  the  system  architecture  as 
independent  as  possible  so  processors  can  be  "plug¬ 
gable"  to  ensure  "catching  up"  with  technological 
advances.  (This  means  that  if  a  new  generation  of 
microprocessors  becomes  available,  they  can  easily 
replace  r^xisting  ones  (can  be  "plugged  in")  without 
the  necessity  of  redesigning  the  structure.) 

Software 

The  GP-MIMD’s  software  configuration  sup¬ 
ports  a  full  standard  UNIX  operating  system,  three 
languages  (C,  C-f- -f ,  and  Fortran  including  debug¬ 
gers  and  performance  monitors),  and  extensive 
off-the-shelf  I/O  facilities.  The  GP-MIMD  archi¬ 
tecture  will  be  a  platform  for  GP-MIMD  UBIK/ 

ASI  parallel  programming  environments,  which 
will  provide  the  "magic  glue"  to  support  a  wide 
spectrum  of  parallel  programming  tools  and  pack¬ 
ages. 

Adei.]uate  support  for  off-line  storage  and 
interactive  graphics  (visualization)  will  also  be 
provided.  Overheads  for  process  creation,  config¬ 
uration  synchronization,  and  communication  will 
be  constant  or  scalable  with  the  system  size.  They 
will  initially  be  implemented  as  libraries  for  C, 
C-H-,  and  Fortran  to  make  them  portable  across  a 
variety  of  operating  systems. 

Conclusions 

GP-MIMD  is  one  of  Europe’s  most  exciting 
entries  to  parallel  computing.  Its  unique  feature, 

"a  pluggable  processor"  capability,  would  eliminate 
a  common  problem  with  parallel  conqiuters.  In  the 
past,  most  of  the  research  parallel  machines,  once 
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completed,  were  obsolete  and  offered  unattractive 
performance,  sinq>ly  because  the  progress  in  pro¬ 
cessor  technology  was  so  fast  that  by  the  time  a 
parallel  machine  was  ready,  its  processing  elements 
were  outdated.  GP4fl[MD  promises  to  eliminate 
this  problem. 

The  GP-MIMD  push  for  standardization, 
portability,  and  scalability  should  be  commended. 

If  these  requirements  are  met  only  in  part,  they 
still  will  be  a  great  contribution  to  the  parallel 
computing  community  and  will  accelerate  commer¬ 
cialization. 

An  important  test  of  GP-MIMD  success  wilt 
be  the  intermediate  goal  of  designing  and  manufac¬ 
turing  the  proposed  cost-effective,  64-processor 
workstation  whose  performance  should  supersede 
that  of  any  of  the  state-of-the-art  products  (from 
companies  such  as  Hewlett-Packard,  SUN,  IBM, 
and  Digital). 

Members  of  the  GP-MIMD  consortium  are 
interested  in  international  cooperation  to  achieve 
agreement  of  world-wide  stan^ds.  I  think  it 
would  be  useful  for  the  U.S.  research  and  develop¬ 
ment  community  to  initiate  a  full-scale  cooperation 
in  thi^  area. 

For  further  information  contact: 

David  May 
Inmos  Ltd. 

1000  Aztec  West 
Almondsbury,  Bristol 
BS12  4SQ 
United  Kingdom 

GENESIS~THE  EUROPEAN  MIMD 
SUPERCOMPUTER 

GBNESIS  is  a  European  MIMD  Supercom¬ 
puter  Superconq>uter  project  led  by  Prof  W.  Giloi 
at  the  Geselschaft  fuer  Mathematik  und  Daten- 
verarbeitung  in  Berlin.*  The  first  part  of  this 
project  was  supported  by  ESPRIT;  further  support, 
especially  for  software,  is  e'tpected  from  a  Japa¬ 
nese  Real  World  Conq)Uting  program. 

The  GENESIS  gruiq>  has  extensive  experience 
gained  on  the  SUPRENUM  project!’  and  is  capable 
of  bringing  new  insights  and  solutions  to  problems 
of  parallel  commuting,  virtualization,  and  scalabil¬ 
ity.  GENESIS’  salient  features  include:  powerful 
processing  nodes,  "crossbar  of  clusters"  intercon¬ 


nection,  and  virtual  shared  memory  (also  called 
shared  virtual  memory  by  some  researchers). 
GENESIS  is  a  solid  effort  in  parallel  computing;  it 
should  result  in  a  competitive  computation  platform 
for  experimenting  with  Fortran  applications,  mem¬ 
ory  architectures,  and  software. 

Architecture 

The  GENESIS  building  block  is  a  node. 

There  are  two  versions  of  nodes— Version.  1  and 
Version.2.  Version.  1  has  four  major  components: 
Application  Processor  (AD),  Communication  Pro¬ 
cessor  (CP),  Local  Communicatin  Interface  (LCI), 
and  a  Network  Link  Interface  (NLI).  Novel  fea¬ 
tures  of  Version.  1  include: 

•  "Pluggable"  processor;  this  is  crucial  be¬ 
cause  of  fast-changing  microprocessor  tech¬ 
nology  (as  in  GP-MIMD).  Many  parallel 
computer  projects  in  the  past  have  suffered 
from  the  fact  that  by  the  time  the  entire 
system  was  prototyped,  its  microprocessors 
were  obsolete. 

•  Memory  interface  is  a  3-stage  pipeline. 

Intel  microprocessors  (i860  series)  are  used  as 
APs  and  CPs;  memory  consists  of  four  banks  of 
S-Mbyte  DRAMs  (dynamic  random-access  memo¬ 
ry)  per  node.  The  NLI  uses  wormhole  routing  and 
provides  4-byte-wide  bidirectional  links  capable  of 
50  Mbyte/s  transfers  on  each  link. 

The  Version.2  node  is  enhanced  by  a  vector 
processor.  The  BIT  B2 110/20  floating-point  pro¬ 
cessors  (33  MHz)  are  used  in  the  vector  processor 
design,  which  has  been  prototyped  and  tested.  In 
Version.2  a  vector  processor  doubles  the  perfor¬ 
mance  for  Linpack  and  Lawrence  Livermore  Loop 
benchmarks,  with  hardware  cost  overhead  of  50-^ 
percent  over  the  Version.!  node. 

A  two-level  interconnection  structure  is  used  in 
GENESIS  to  connect  processing  nodes. Each 
cluster  has  two  crossbars;  one  to  connect  the 
nodes  within  a  cluster:  another  to  provide  links  to 
all  other  clusters.  This  results  in  a  2-link  require¬ 
ment  per  processing  node.  This  obviously  is  more 
attractive  than  in,  for  exanqile,  a  hypercube  where 
the  number  of  links  is  0(log  n),  n  teing  the  number 
of  nodes.  A  2S6-processor  prototype  would  re¬ 
quire  two  levels  of  16x16  crossbars. 
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Software 

The  operating  system  called  NOS  (node  oper¬ 
ating  system)  is  a  refined  version  of  SUPRE- 
NUM’s  PEACE  (Program  Extension  And  Commu¬ 
nication  Environment).’^  The  main  goais  are 
resource  management  of  each  node  and  efficient 
communication  support  among  them.  NOS  also 
supports  scalability,  an  arbitrary  distribution  of 
global  services,  and  virtual  memory  with  demand 
paging.  The  process  model  of  connecting  light¬ 
weight  processes  into  teams  and  leagues  is  hierar¬ 
chical.  Send-receive-reply  sequences  are  used  to 
support  message  passing,  and  system  services  are 
invoked  by  remote  procedure  cdls.  Data  objects 
are  transferred  by  a  mechanism  called  high-volume 
data  transfer,  which  uses  a  DMA  (direct  memory 
access)  address  generator  once  a  link  is  established. 
The  DMA  generator  is  part  of  a  CP;  it  can  be 
assigned  to  address  elements  of  any  data  structure 
in  any  desired  order.  This  capability  is  useful  in 
high-volume  data  processing  such  as  large  vectors 
and  matrices. 

Support  software  for  partitioning  the  applica¬ 
tion  programs  in  cooperating  tasks  has  been  devel¬ 
oped.  The  operating  system  is  also  capable  of 
automatic  distribution  of  cooperating  tasks  over  the 
nodes.  Regular  grid  problems,  grid  generators, 
communication  routines,  and  standard  functions/ 
algorithms  can  be  preprogrammed  and  are  avail¬ 
able  to  an  application  programmer  in  the  form  of 
library  routines;  linear  algebra  computations  and 
multigrid  PDE  (partial  differential  equation)  solvers 
are  available. 

GENESIS  tools  include 

•  a  co^^)iler  for  Fortran-90; 

•  simulators  for  program  development  and 
verification  that  run  on  conventional  UNIX 
machines; 

•  a  parallel  debugger  for  debugging  Fortran 
programs; 

•  performance  analysis  and  visualization  tools 
for  rq>lay  of  task  activities; 

•  protocol  timing;  and 

•  statistical  data  on  processor  utilization  and 
communication. 

The  GENESIS  execution  environment  provides 
UNIX-like  user  interface.  Current  efforts  focus  on 


developing  the  Virtual  Shared  Memory  Architec¬ 
ture  to  free  programmers  from  partitioning,  distrib¬ 
uting,  and  communicating  cooperative  tasks.  In 
other  words,  this  is  yet  another  effort  toward  de¬ 
veloping  a  parallelizing  compiler  (for  which  GEN¬ 
ESIS  architecture  could  provide  a  good  experimen¬ 
tal  platform). 

Conclusions 

GENESIS  is  pragmatic;  it  is  based  on  funda¬ 
mental  principles  for  parallel  architectures.  Its 
potential  success  lies  in  demonstrating  global  ad¬ 
dress  space  implementation  in  a  message-passing 
environment.  If  the  Virtual  Shared  Memory  Ar¬ 
chitecture  is  scalable  and  portable,  it  will  have  a 
strong  impact  on  the  next  generation  of  parallel 
computers. 

For  further  information  contact: 

Wolfgang  K.  Giloi 
GMD  Research  Center  for 
Innovative  Computer  Systems 
and  Technology 
Technical  University  of  Berlin 
Hardenbergplatz  2 
D-1000  Berlin  12 
Federal  Republic  of  Germany 

SPECIAL-PURPOSE  PARALLEL 
COMPUTERS 

APEIOO-A  lOO-GIGAFLOPS 
Parallel  Computer 

Synopsis 

Twelve  physicists  (turned  conq)uter  scientists 
and  engineers)  at  the  Institute  of  Nuclear  Physics, 
University  of  Rome,  are  developing  an  SIMD 
quantum  chromo  dynamics  KXl^FLOPS  comput¬ 
er.  The  APEKX)  is  a  followup  to  the  APEl, 
whose  peak  performance  is  around  one  GFLOPS. 
The  new  machine  has  a  3D  lattice  organization 
with  8x16x16  (2K)  custom-designed  processors. 
Each  node  is  connected  to  its  six  members  (two  in 
each  dimension)  with  wraparound  connections  on 
the  boundaries. 

Key  elements  are  a  64-MFLOPS  processor 
(10-stage  pipelined  floating-point  multiplier  and 
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adder— somewhat  similar  to  Intel’s  i860)  and  a 
smart  optimizing  compiler.  This  construction 
virtually  squeezes  out  every  cycle  of  available 
performance. 


connected  through  speciflc  interfaces  and  connected 
to  a  controller.  A  key  element  of  each  processing 
unit  is  the  floating-point  processor,  which  is  a  1.2- 
/tm  CMOS  multiplier  and  adder  (MAD)  device.  A 


Tables  1  and  2  provide  salient  characteristics 
of  the  APE  1(X)  architecture  packaging  and  perfor¬ 
mance.  With  $10  million  funding,  this  is  one  of 
Europe’s  most  interesting  projects. 

Arrhitecture 

The  APEIOO  is  a  fme  grain,  modular  SIMD 
machine  that  can  be  subdivided  into  two  layers: 

1.  A  synchronous  kernel— an  SIMD  parallel 
computer  driven  by  a  synchronous  ma¬ 
chine  called  S-CPU  (synchronous  central 
processing  unit) 

2.  An  asynchronous  shell— the  host  computer 
and  the  asynchronous  interface,  which  is 
conq)Osed  of  a  network  of  transputers. 

A  parallel  conputer  is  an  assembly  of  customized 
processing  units  configured  as  a  3D-lattice  inter- 


_ Table  1— APEIOO  at  a  Glance 

•  Three-dimensional  topology 

•  Fine-grained  modular  SIMD  architecture 

(from  8  to  4096  nodes) 

•  4  MBytes,  128  registers  and  SO 

MFLOPS  per  node 

•  Hardware  support  for  local  (non-SIMD) 

conditional  structures 

•  Centralized  address  and  integer 

conputations 

•  Simultaneous  RAM  to  register  transfers 

and  arithmetic  caiculations 

•  25  MHz  master  clock 

•  8  nodes  (400  MFLOPS)  per  PB 

•  1  to  32  crates,  each  containing  16  pro¬ 

cessing  boards  and  1  controller 


Table  2— APEKX)  Architecture 


FLOATING  POINT  PERFORMANCES 

1  Processing  board 

400  MFLOPS 

1  Crate 

6.4  GFLOPS 

Maximum  configuration 

200  GFLOPS 

DATA  MEMORY  SIZE 

1  Processing  board 

32  MBytes 

1  Crate 

S12  MBytes 

Maximum  configuration 

16  Gbytes 

RAM  TO  REGISTER  TRANSFER  RATE 

1  Node 

50 

MBytes/s 

Global  transfer  rate 

200 

(max.  conf.) 

GBytes/s 

DISK  STORAGE 

1  Crate 

600  MBytes 

Maximum  configuration 

20  GBytes 

MEMORY-TO-DISK  TRANSFER  DATA 

1  Crate 

300 

KBytes/s 

Maximum  configuration 

9.6 

MBytes/s 

Full  memory  backup 

-30  min. 

CONTROL  PERFORMANCES 

Synchronous  CPU 

24  MIPS 

Asynchronous  interface 

2  MIPS 

Synchronous  program 

128  K  Instructions 

memory  size 

0.5  MBytes 

S-CPU  data  memory 

LAI  memory 

1  MBytes 

POWER  CONSUMPTION  | 

1  Crate 

-2kW 

Maximum  configuration 

~64kW 

COOLING  SYSTEM 

Forced  ventilation,  ambient  tenperature  and 
humidity  controlled. 
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SO-MFLOPS  MAD  has  an  adder  and  a  multiplier 
based  on  real  arithmetics.  Additionally,  MAD 
contains  128  32-bit  register  space,  which  is  much 
larger  than  in  a  typical  processor  support  circuitry. 
MAD  can  complete  one  floating-point  addition,  one 
floating-point  multiplication,and  one  I/O  operation 
in  each  40-ns  cycle  MAD  contains  backup  tables 
for  approximating  frequently  used  functions  such 
as  1/x,  log(x),  l/sqrt(x),  exp(x).  MAD  is  hard¬ 
wired  to  execute  D=A»(+B)+Cso  that  three 
specific  operations  can  be  executed  by  loading 
appropriate  registers  with  values  0  or  1  such  that 

ADD/SUB  is  D=1  *  (-l-B)+C 

MPY  is  D=A  *  (+B)+0 

NO-OPERATION  is  D*0  *  0  0. 

A  stack  allows  for  nesting  up  to  eight  conditional 
structures  for  elementary  conditions  such  as 
EQ.zero,  GT.zero,  GE.zero,  and  NE.zero.  The 
evaluation  is  derived  from  Boolean  AND,  OR, 

NOT  operators. 

A  processing  node  comprises  one  MAD  and 
four  MByte  of  local  memory.  Eight  nodes  form 
a  processing  board  (PB)  and  are  assembled  on  a 
single  printed-circuit  board  (PCB).  For  example, 
a  2K-node  machine  would  require  2S6  PBs  (16 
crates)  and  would  have  a  potential  of  delivering 
lOO-GFLOPS  peak  performance;  for  a  sustained 
lOO-GFLOPS  performance,  4K  nodes  would  be 
required.  Each  crate  (16  PBs)  contains  one  central 
processor,  called  S-CPU,  that  synchronously  exe¬ 
cutes  a  program  that  is  simultaneously  distributed 
to  128  processing  nodes. 

The  design  of  the  S-CPU  is  based  on  a  proces¬ 
sor  developed  at  CERN.  It  is  a  synchronous, 

AMD  29000  series-based,  3-stage  pipeline  integer 
processor  operating  at  an  80-ns  cycle.  The  S-CPU 
controls  program  flow  and  executes  IF  and 
IF_ALL  conditional  structures. 

The  host  computer  (VAX  workstation)  is 
connected  via  a  transputer-based  root  asynchronous 
interface  (RAI)  and  numerous  local  asynchronous ; 
interfaces  (LAIs),  to  processing  nodes.  The  local 
S-CPU  and  the  associated  LAI  transputer  form  a 
controller  unit  that  is  housed  on  a  separate  PCB. 
Global  data  are  distributed  via  controllers;  local 
data  reside  in  MAD  registers  and  node  memories. 


Data  and  programs  are  downloaded  from  a  disk. 
Experience  on  APE  shows  that  the  required  disk 
space  should  be  one  to  three  times  the  RAM  space 
for  an  efficient  execution  of  quantum  chromo 
dynamics  (QCD)  programs.  The  main  storage  of 
10-20  GBytes  should  be  sufficient.  A  back-up 
copy  of  the  whole  system  takes  about  30  minutes  at 
a  global  transfer  rate  of  5  MBytes/s. 

Although  the  APE-lOO’s  hardware  is  QCD-ori- 
ented,  it  performs  well  for  many  other  applica¬ 
tions.  The  power  of  this  architecture  lies  in  its 
simplicity. 

APEIOO  Software 

APE  100  software  contains  standard  software 
for  the  host  computer,  a  compiler  chain,  interface 
package,  and  customized  operating  system  with  a 
symbolic  debugger. 

The  compiler  chain  includes  an  Apese  compil¬ 
er  and  an  optimizer.  Apese  is  a  structured  lan¬ 
guage  based  on  Fortran  but  without  its  idiosyncra¬ 
sies.  Its  extensions  support  SIMD  architecture  of 
APE  and  provide  for  closer  control  of  the  data  in 
registers  and  memories.  The  first  step  in  the  com¬ 
piler  chain  produces  an  intermediate  code  that  is 
then  optimized  in  the  second  step  by  the  optimizer. 
The  optimizer  concentrates  on  floating-point  opera¬ 
tions  to  maximize  the  APE’s  performance.  First, 
it  tries  to  combine  additions  and  multiplications 
into  so-called  NORMAL  operations  (i.e.,  A*B-l-C) 
for  which  MAD  architecture  is  optimized.  The 
optimizer  then  arranges  the  codes  such  that  the 
total  number  of  machine  cycles  is  minimized.  The 
only  limits  on  this  optimization  are  hardware  and 
operands  availability  constraints.  The  third  phase 
consists  of  register  allocation.  Executable  code  is 
then  produced. 

The  APE’S  operating  system  (HACK  -  host 
resident  APE  control  kernel)  executes  on  the  host 
computer  and  the  synchronous  interfaces.  In  addi¬ 
tion  to  standard  operating  system  functions  such  as 
user  interface,  I/O,  loading,  program  execution 
status  monitoring,  and  memory  management,  it 
includes  the  symbolic  debugger.  This  debugger  is 
capable  of  swapping  different  tasks,  executing 
single  or  multiple  selected  steps,  and  reading  any 
memory  or  register  contents. 
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Condusious 

The  APEtOO  !ias  the  potential  of  becotnint^  the 
world's  fastest  QCD  machine.  Its  power  lies  in 
focus  on  a  specific  application  and  simplicity.  The 
application  is  well  understood  and  led  by  world- 
class  physicist  with  advice  from  two  Nobel  laure¬ 
ates.  The  system’s  configuration  of  4096  nodes 
may  experience  some  difficulties  because  of  tradi¬ 
tional  problems  such  as  power  supply,  clocking, 
and  cooling. 

For  further  information  contact; 

Nicole  Cabibbo 
Gruppo  APE 
Dipartamento  di  Fisica 
Universita  di  Roma  "La  Sapienza" 

Piazzale  Aldo  Moro  2 
00185  Roma 
Italy 

ASSOCIATIVE  STRING  PROCESSOR 
Synopnis 

The  faculty  of  Brunei  University  (U.K.)  and 
an  on-campus  company  (Aspex  Microsystems)  are 
developing  an  associative  computer  architecture 
suited  for  multichip  and  wafer-scale  integration 
(WSI)  technology.  Professor  M.  Lea,  project 
director,  has  been  pursuing  his  ideas  of  Associative 
String  Processor  (ASP)  for  the  last  15  years;  it 
appears  that  his  determination  and  persistence  will 
pay  off. 

Key  elements  of  ASP  architecture  include 

•  content-addressing  capability  that  eliminates 
the  need  for  processor  addressing; 

•  a  unique,  dynamically  reconfigurable  net¬ 
work  that  minimizes  data  transfer  and  pro¬ 
vides  fault  tolerance; 

•  low  power;  and 

•  high  packing  density. 

This  architecture  is  ideally  suited  for  real-time 
signal  processing.  It  has  already  been  installed  at 
CERN  in  Geneva,  Fermi  Ijsboratories  in  Chicago, 
and  other  locations.  It  demonstrates  clear  superior¬ 
ity  in  high-energy  physics  and  image-underst^ing 
benchmarks. 


Architecture 

The  Associative  String  Processor  is  a  string  of 
content-addressable  l>bit  processing  elements  con¬ 
nected  through  a  hierarchy  of  bus-like  networks. 
Their  organization  supports  data-levei  parallelism. 
Content  addressing  results  in  processor  address 
elimination,  natural  scalability,  and  fault  tolerance. 
The  data  structures  are  wrapped  onto  a  string  that 
is  both  simple  and  efficient.  Avoidance  of  data 
addressing  and  minimization  of  data  movement 
maximizes  the  performance  for  a  number  of  appli¬ 
cations. 

The  Associative  Processing  Elements  (APEs) 
are  connected  through  a  chordal  ring  network  that 
allows  efficient  transfer  among  neighboring  ele¬ 
ments.  For  applications  that  require  more  diverse 
data  exchanges,  this  network  would  not  provide 
adequate  support,  but  it  remains  to  be  seen  how 
many  applications  require  an  arbitrary  interconnec¬ 
tion  pattern.  A  regular  interconnection  pattern 
allows  high  packing  density.  This  regularity  and 
low  power  requirements  make  this  architecture 
well  suited  for  WSI  implementation. 

The  organization  of  APE  supports  associative 
processing  in  which  a  processing  element  is  acti¬ 
vated  if  it  matches  a  particular  flag  for  a  given 
stage  of  computation.  The  layers  of  chordal  ring 
network  conrect  blocks  of  four  APEs.  Each  block 
is  then  provided  with  a  bypass  and  additional  lay¬ 
ers  of  bypas!.  networks  can  connect  every  16th, 
64th,  etc.  processor.  Depending  on  a  particular 
communication  pattern,  various  levels  of  bypass 
may  be  activated.  If  some  elements  of  a  string 
are  faulty,  they  can  be  permanently  bypassed  by 
appropriately  setting  the  network. 

liie  ASP  systems  are  packaged  by  using  con¬ 
ventional  and  multichip  technologies  (hybrid-ASP) 
and  monolithic  wafers  (monoIithic-WSI). 
Hybrid-ASP  prototypes  include  2D-packages: 
HASP-1  with  1,024  processors;  HASP-2  with 
4,096  processors;  and  HASP-3  with  8,192  process¬ 
ing  elements.  A  3D  HASP  is  under  preliminary 
investigation.  The  monolithic-WSI  development 
includes  several  prototype  WASP  l/2a/2b  with 
720/864/6480  processors;  WASP  3/4/5  with 
8,192/16,384  processors;  and  WASP-6  with 
16,384/32,768  processing  elements.  These  sys¬ 
tems  have  been  developed  in  cooperation  with 
Piessey,  Hughes,  General  Electric,  and  CERN. 


488 


*:8NIB  02-Oe 


The  largest  ones,  with  16K/32K  processing  ele¬ 
ments,  are  tested  at  Rome  Laborator>',  Griffis  Air 
Force  Base,  New  York. 

Current  yield  of  fault-free  processors  on  a 
wafer  is  about  SO  percent  For  example,  an  8K 
processor  system  is  capable  of  delivering  14-Giga 
OPS  (12-bit  words)  peak  performance,  by  using  a 
40  MHz  clock,  12  W  of  power,  and  1-micron  (fan) 
technoiogy. 

Software 

ASP  has  its  own  specially  designed  instruction 
set,  tailored  to  associative  processing.  The  operat¬ 
ing  environment  consists  of  three  levels  of  control¬ 
lers  and  a  string  of  APEs.  The  high-level  control¬ 
ler  is  a  SUN  workstation,  and  programmers  can 
use  C,  Pascal,  and  Ada  languages  for  developing 
their  applications. 

Several  applications  have  already  been  devel¬ 
oped.  Abingdon  Cross  and  CERN  high-energy 
physics  benchmark  indicate  that  the  ASP  outper¬ 
forms  most  conqieting  architecPires.  CERN’s 
benchmark  is  processed  at  almost  one  billion  pix¬ 
els/s.  Also  with  respect  to  the  Defense  Advance 
Project  Agency’s  (DARPA’s)  image-understanding 
benchmarks,  ASP  architecture  comes  out  ahead. 
Most  impressive  is  the  back-propagation  model 
benchmark  for  neural  computing  where  the  ASP 
outperfonns  Cray  XMP  by  a  factor  of  7.  Other 
applications  that  have  been  impiemented  include 
control,  signal,  image,  and  data  processing.  Table 
3  shows  expected  performance  of  64K  ASP  for  two 
substrates. 


Conclusions 

The  ASP  is  not  just  a  design.  It  is  a  real, 
existing  product  with  unique  architecture  that  has 
been  incorporated  into  military  and  commercial 
systems.  It  is  one  of  the  pioneering  WSI  efforts; 
for  some  applications  the  ASP  delivers  unmatched 
performance.  It  may  unquestionably  become  one 
of  Europe’s  most  interesting  projects  if  program¬ 
ming  issues  are  resolved.  The  real  challenge  here 
is  software  and  its  portability. 

For  further  information  contact; 

Michael  Lea 

ASPEX  Microsystems  Ltd. 

Brunei  University 

Uxbridge  UBS  3PH 

United  Kingdom 


THE  DISTRIBUTED  ARRAY  PROCESSOR 

The  Distributed  Array  Processors  (DAP)  series 
from  Active  Memory  Technology  (AMT),  Read¬ 
ing,  U.K.,  came  from  an  .\nglo-American  start-up 
that  is  a  formidable  competitor  to  Connection 
Machines. 

This  section  has  a  slightly  different  format;  it 
includes  a  brief  history  of  the  evolution  of  DAPS 
that  is  not  well-known  in  the  U.S.,  as  well  as  a 
short  description  of  the  DAP  series  and  their  per¬ 
formance.  This  is  followed  by  a  note  on  the  Star¬ 
light  Module,  which  might  become  the  world’s 
smallest  gigaFLOPS  computer. 


Table  3— Expected  Perfonriance  of  64K  Processor  ASP 


Configuration 

I  or  2  MIMD  processors 

Performance 

100  Giga-OPS  (12-bit  adds  at  40  MHz) 

Ii^ut-output  bandrvidth 

640  MBytes/s  (at  20  MHz) 

Number  of  processors 

65,536  (fault-tolerant) 

Package  size 

6.4  X  5.88  X  0.3  in.  (1/2  SEM-E) 

Power  dissipation 

<  lOOW 

Market  cost 

<  $100,000 

i 
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From  ICL  to  AMT 

First  reports  on  square  arrays  of  1-bit  proces¬ 
sors  were  published  in  1972-76,  mainly  by  Interna¬ 
tional  Computers  Ltd.  (ICL),  then  the  largest 
British  computer  company.  International  Comput¬ 
ers  Ltd.  is  currently  owned  by  Fujitsu.  In  1976  the 
first  array  with  1024  1-bit  processing  elements 
(PE)  was  introduced  as  a  front-end  computer  to  the 
ICL  1900  series.  In  1980  a  4096  FE  array  was 
installed  in  the  ICL  2900  series.  Although  the 
machine  was  a  commercial  failure  (only  six  of 
them  were  manufactured),  it  created  an  explosion 
of  interest  in  academic  circles,  which  resulted  in 
more  than  1,000  publications.  The  second  genera¬ 
tion  of  DAPs  appeared  five  years  later  in  the  ICL’s 
minicomputer  (called  PERQ).  But  again,  hand¬ 
crafted  technology,  lack  of  software,  and  low 
reliability  resulted  in  only  13  copies  of  the  machine 
being  made.  Those  were  distributed  mainly  to 
academic  and  research  laboratories. 

In  1985/86  ICL  decided  to  spin  off  the  DAP 
technology.  In  October  1986,  Active  Memory 


Technology  Ltd.  was  formed  with  £4.2  million 
startup  capital;  25  percent  from  ICL,  SO  percent 
from  venture  capitalists,  and  25  percent  from  staff 
and  founders.  A  core  team  came  from  ICL.  After 
two  years  of  intensive  design  efforts,  production 
and  shipment  of  the  first  DAPs  began  in  1988. 

Today,  AMT  has  a  staff  of  90  (50  percent 
U.K.,  50  percent  U.S.)  and  more  than  80  systems 
have  been  installed  in  7  countries.  Their  strategy 
is  to  concentrate  on  European  and  U.S.  markets. 
Overall  system  design  and  software  are  developed 
in  the  U.K.;  VLSI  design,  system  integration,  and 
manufacturing  are  done  in  the  U.S. 


DISTRIBUTED  ARRAY  PROCESSORS 

AMT  currently  offers  four  models  of  DAPS: 
510,  610,  5!0c,  and  610c.  Table  4  lists  their  basic 
parameters. 

In  principle,  DAP  is  a  single-instruction  multi¬ 
ple-data  computer  (SIMD),  which  would  imply 
limited  applicability.  However,  it  is  shown  later 


Table  4  -  Summary  of  DAP  Performance 


Model 

510 

610 

510c 

610c 

Clock  Rate 

10  MHz 

10  MHz 

10  MHz 

10  MHz 

Processors 

1  bit 

1024 

4096 

1024 

4096 

8  bit 

- 

- 

1024 

4096 

Peak  Perfonnance 

1  bit  MIPS 

10,000 

40,000 

10,000 

40,000 

8  bit  MIPS 

400 

1,600 

5,000 

20,000 

32  bit  MLFLOPS 

15 

60 

140 

560 

I/O  MBytes/s 

40-»-40 

80-1-80 

40-1-40 

80+80 

Processor  Use  for  I/O 

3% 

1.5% 

3% 

1.5% 

Relative  Power 

1 

4 

10 

40 

Price  Performance 

8,000 

5,000 

1,150 

700 

$/MFLOPS 
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that  it  can  tackle  an  amazing  spectrum  of  problems 
with  surprising  efficiency. 

Basic  architecture  of  a  DAP  computer  com¬ 
prises  a  32x32  =  1024  array  of  1-bit  PEs  for  the 
510  models,  and  a  64  x  64  «  4096  array  of  1-bit 
PEs  for  610  models.  In  addition  to  1-bit  PEs, 
models  SlOc  and  610c  have  8-bit  PEs  to  enhance 
floating-point  computation.  Each  processor  ele¬ 
ment  is  connected  to  its  four  neighbors,  and  a  bus 
.system  connector  processes  by  rows  and  by  col¬ 
umns.  Each  processor  is  connected  to  a  minimum 
of  its  own  32  Kbits  of  local  memory,  which  can  be 
extended  to  IMbits.  This  gives  a  memory  range  of 
between  4  and  512  Mbytes.  The  processor  array  is 
controlled  by  a  master  control  unit,  which  inter¬ 
prets  a  program  located  in  its  private  memory  and 
broadcasts  decoded  instructions  'o  the  entire  pro¬ 
cessor  arrays.  Then  PEs  concurrently  execute 
these  instructions,  processing  the  data  in  their  local 
memories.  Even  with  the  10  MHz  clock,  the  DAP 
architecture  in  a  4096  PE  configuration  is  capable 
of  delivering  more  than  40  Gbit  operations/second 
(e.g.,  logic  operations).  Also,  a  maximum  aggre¬ 
gate  data  rate  between  processors  and  their  memo¬ 
ries  may  reach  5.12  Gbytes  per  second. 

A  SUN  or  VAX  workstation  serves  as  DAP’s 
host  and  is  connected  to  the  master  control  unit. 
This  provides  a  highly  inte.’active  environment  and 
interface  to  a  wide  range  of  peripherals.  Although 
input/output  via  the  host  may  be  sufficient  for 
some  applications,  there  is  a  capability  for  direct 
I/O  to  the  DAP  via  fast,  40  Mbyte/s  bidirectional 
data  channels.  This  does  not  signific.antly  affect 
array  performance  (3  percent  degradation  for  DAP 
510  and  0.8  percent  for  DAP  610).  There  is  also  a 
VME'bus  interface  to  a  host  connection  unit. 

DAP’s  software  from  a  programmer’s  point  of 
view  can  be  grouped  into  two  sets; 

1.  Host  machine  software,  which  is  a  standard 
set  (UNIX,  tools,  libraries,  etc.),  and 

2.  Array  software,  which  is  written  in  a 
standard  extended  language  (such  as 
Fortran-Plus  or  assembler)  and  exploits 
the  parallel  capabilities  of  DAPs. 

Fortran-Plus  is  similar  to  the  recent  standard, 
Fortran-90,  and  enables  array  manipulation  of  any 
size.  The  Array  of  Processors  Assembly  Language 


(APAL)  is  a  macroassembler  that  provides  com¬ 
plete  control  of  all  array  elements.  A  hybrid  of 
APAL  and  Fortran-Plus  is  feasible;  integrating 
APAL  into  Fortran-Plus  may  accelerate  execution 
of  some  time-consuming  operations  in  a  given 
application. 

Application-support  libraries  include  a  wide 
range  of  functions  to  manipulate  matrices  and 
vector  mathematical  routines.  It  also  has  graphics, 
image  processing,  digital  signal  processing,  and 
windowing  routines.  In  addition,  there  is  a  DAP 
simulator,  which  is  fully  hardware  compatible.  It 
allows  users  who  do  not  have  access  to  DAP  com¬ 
puters  to  develop  and  test  DAP  programs  off-line. 
Program  State  Analysis  Mode  (PSAM)  is  an  inter¬ 
active  debugger  for  Fortran-Plus  and  APAL  pro¬ 
grammers. 

Performance 

DAPs  are  especially  good  in  a  naturally 
parallelizable  applications.  Table  S  describes  the 
performance  of  some  typical  operations. 

Performance  of  exact  and  partial  matches  with 
estimation  of  statistical  significance  is  measured  in 
millions  of  cell  updates  per  record  (Meups).  For 
example,  DAP  610  with  Fortran  code  can  execute 
28  Meups;  with  assembler  it  can  execute  84 
Meups.  A  Connection  Machine  CM2  with  32K 
PEs  and  microcoded  loops  can  execute  25  Meups. 
Also,  for  some  specific  applications,  when  com¬ 
pared  with  respect  to  price/performance  to  IBM’s 
3090V,  Floating-Point  Systems’  264  machine,  and 
Cray’s  XMP,  DAPs  seem  to  be  very  good  (Table 
6). 

With  more  than  80  systems  sold,  AMT  has  a 
remarkable  presence  at  British  universities  (22). 
some  at  U.S.  universities,  3  at  government  labora¬ 
tories  (2  in  the  U.K.  and  1  at  the  Argonne  National 
Laboratory).  Five  major  British  companies  own 
DAPs,  including  a  news  agency  that  uses  DAP  for 
keyword  searching.  Thirty-six  systems  were  sold 
as  of  August  1990  to  American  companies  and 
government,  including  the  U.S.  Navy  and  the  U.S. 
Army.  About  10  systems  have  been  sold  in  Bel¬ 
gium,  France,  FRG,  Ireland,  and  Japan. 

Applications  cover  a  wide  range.  Image  and 
signal  processing  dominate  in  the  defense  indus¬ 
try;  research  on  parallel  computing  and  medical 
imaging  predominate  in  the  universities.  Other 
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Table  5  —  Performance  of  Typical  DAP  Operations 
Image  size  is  512x512  elements  with  8-bit  pixels 


Time  (ms) 

Time  (ms) 

Model 

510 

610 

3x3  Convolution 

14 

3.5 

Frame-to-firame  differencing  and 
thresholding 

1.5 

0.4 

3x3  median  filter 

15 

3.8 

Histograming  (256  levels) 

40 

10 

Binary  image  thinning 

2 

0.5 

Image  normalization 

3 

0.8 

2-D  FFT 

68 

17 

Image  segmentation 

5 

1.3 

Table  6  —  Price/Performance  Relative  to  Cray  XMP 


DAP 

510 

DAP 

610 

3090V 

FPS 

264 

CRAY 

XMP 

Simple  loops 

■■ 

fl.pt.  32 

4.8 

8.1 

■EH 

1 

fl.pt.  64 

1.6 

3.1 

1 

integer 

31.6 

49.7 

0.3 

0.6 

1 

Livermore 

all 

0.5 

0.9 

1.6 

1 

parallel 

4.4 

5.8 

■EH 

0.5 

1 

log  parallel 

0.3 

2.3 

1 

serial 

0.2 

1.0 

1.6 

1 

Kernels 

■i 

convolutions 

39.6 

55.9 

1 

zero  crossings 

54.0 

67.9 

■^1 

1 

histogram 

5.2 

7.7 

1.1 

1 

comer  find 

14.8 

17.5 

0.6 

0.8 

1 

ESTEC  algoriduns 

■■HI 

CFD 

2.4 

1.7 

1 

LLT 

1.6 

2.4 

1.1 

1 

random 

5.6 

5.3 

1 

edges 

27.6 

45.4 

0.5 

■91 

1 
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applications  include  molecular  physics,  neural 
networks,  molecular  biology,  computer  design 
simulation  (also  circuit  simulation),  finite-element 
codes,  fluid  dynamics,  genetic  sequence  matching, 
data  compression,  radar  processing,  mine  detec¬ 
tion,  human  interface,  and  seismic  processing. 

Starlight  Module 

The  latest  in  the  DAP  series  designs  is  a  multi¬ 
processor  called  Starlight.  It  has  4095  processors, 
20  MHz  clock,  16-Mbyte  data  memory,  4-Mbyte 
instruction  memory,  and  320  Mbytes/second  I/O 
rate.  It  uses  multichip  technology  and  is  packaged 
in  a  small  cylinder,  S  inches  in  diameter  and  2.5 
inches  high.  Starlight  initially  will  deliver  32-bit 
120  MFLOPS,  but  there  are  plans  to  upgrade  it  to 
1.2  GFLOPS,  which  would  make  it  the  world’s 
smallest  GFLOPS  engine.  At  $250  per  MFLOPS 
and  $10  per  MIP,  it  will  also  be  very  cost-effec¬ 
tive. 

Conclusions 

The  Starlight  module  could  become  one  of  the 
most  inexpensive  multiprocessors  among  special- 
purpose  parallel  computers.  It  is  competitive  in 
performance  with  Connection  Machines  while 
presenting  lower  cost  and  superb  packaging  tech¬ 
nology. 

For  more  detailed  information  contact; 

Goeff  Manning 

Active  Memory  Technology  Ltd. 

65  Suttons  Park  Avenue 
Reading  RG6  lAZ 
United  Kingdom 

EUROPEAN  DECLARATIVE  SYSTEM  (EDS) 
^nopris 

The  European  Declarative  System  is  Europe’s 
most  ambitious  project  in  developing  a  database 
engine  for  the  1990s.  The  project  is  carried  out  at 
the  European  Congniter-industry  Research  Centre 
in  Munich,  FRG.  It  is  stiq)port^  by  three  major 
European  companies;  Bull  (Fiance):  International 


Computers  Ltd.  (U.K.,  currently  owned  by 
Fujitsu);  and  Siemens  (FRG).  Additional  support 
is  provided  by  the  European  Community’s  ESPRIT 
program. 

The  EDS  is  capable  of  supporting  parallel 
database  operations  for  a  variety  of  operating 
environments,  including  SQL.  The  minimum 
performance  requirements  aim  at 

•  12,000  transactions  per  second  for  data¬ 
base  applications; 

•  140  Boyer  runs  per  second  for  Lisp;  and 

•  32  M  logical  inferences  per  second  for 
ElipSys,  the  logic  progranuning  system. 

This  diversity  and  richness  of  supported  environ¬ 
ments,  installed  on  multiprocessor  parallel  sys¬ 
tems,  holds  the  promise  of  state-of-the-art  database 
engines  that  will  be  equally  useful  to  researchers, 
p.^actit. oners,  and  users. 

Architecture  and  Software 

The  EDS  prototype  is  a  128-processor  system 
based  on  a  buffered,  multistage  network  and  Sparc 
microprocessors.  The  important  feature  of  this 
architecture  is  "seamless"  support  for  both  shared 
and  message-passing  memories  by  the  use  of  dis¬ 
tributed  shar^  virtual  memory.  This  computer 
contains  the  Parallel  Distributed  Store  Machine, 
which  processes  queries,  and  the  EDS  Machine 
Executive  (EMEX),  which  is  a  :mall  operating 
system  capable  of  concurrency  control,  logging, 
recovery,  and  load  balancing.  It  supports 
deadlock-free  two-phase  locking.  Specific  issues 
related  to  fault  tolerance  (such  as  ejects  of  shad¬ 
owing,  frequency  of  checkpointing,  log  file  update 
policy,  and  synchronization  among  computing 
nodes  holding  fragments  of  the  same  database)  are 
investigated  and  are  being  incorporated  into 
EMEX. 

Lisp  Applications  are  on  top  of  the  Lisp  sub¬ 
system,  which  was  developed  in  a  separate  Europe¬ 
an  Computer-Industry  Research  Centre  [(ECRC), 
Munich,  FRG)]  project  called  the  Knowledge 
Crunching  Machine  (LCM),  is  enable  of  IM  lips 
(millions  of  logical  inferences  per  second)  peak 
performance  on  a  15  MHz  KCM. 
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The  Bl^ys  Logic  Programming  Systeni  inter¬ 
im  the  EDS  machine  with  any  stimd^  SQL- 
based  database  product  (SQL  is  the  most  widely 
accepted  standard  for  database  systems).  This 
focilitates  logic  programming  on  standard  data  that 
use  ECRC-designed  "common  exchange  format." 
Blipsys  iqpplications  include  a  tourist  informa- 
tion/advisory  system  developed  at  the  University  of 
Athens  and  a  bank  treasury  management  support 
system.  The  biggest  commercial  impact  may  come 
from  SQL  support  on  top  of  a  standard  UNIX 
subsystem  that  also  supports  a  variety  of  UNIX,  C, 
and  C4-  +  applications. 

Conclusions 

EDS  is  one  of  the  most  exciting  database 
projects  in  Europe;  it  may  successfully  conq>ete 
with  database  lexers  such  as  Teradata  and  others. 
If  goals  are  met  on  time,  it  may  significantly  affect 
research  and  commerce  for  future  generations  of 
database  machines.  It  will  also  demonstrate  that; 

•  cost-effective  parallel  database  machines 
can  be  built,  and 

•  cooperation  among  large  companies  such 
as  Bull,  ICL,  and  Siemens  in  bringing  a 
successful  product  to  the  market  is  feasi¬ 
ble. 

Richness  and  diversity  of  concepts,  combined  with 
pragmatic,  practical  approaches  to  securing  such 
basic  capabilities  as  SQL  and  UNIX  ^plications, 
may  lead  to  success. 

For  ftirther  information  contact: 

Mike  Reeve 

European  Computer-Industry 
Research  Centre  (ECRC) 

Arabellastrasse  17 
D-8000.  Munich  81 
Federal  Rq>ublic  of  Germany 

SFECIAL-PURFOSE  TRANSPUTER  SYSTEMS 

S^jfnopsis 

Transpiuers  are  ideal  building  blocks  to  config¬ 
ure  special-purpose  systems  that  are  tailored  to 
Beloit  execution  of  a  particular  q>plication.  The 


special-purpose  systems  briefly  described  here 
illustrate  tMs  flexibility  as  well  as  demonstrate  a 
diversity  of  applications. 

Examfdes  of  Spedal-Purpose  Systems 

A  43-transputer  system  has  been  developed  at 
Volkswagen  Research.  It  is  a  driving  simulator 
that  is  capable  of  simulating  the  behavior  of 
brakes,  steering,  and  suspension  systems.  The 
performance  of  600  polygons  at  25  frames/s  is  still 
far  behind  state-of-the-art  simulators  from  Evans 
and  Sutherland  (U.S.),  but  the  price  is  orders  of 
magnitude  lower.  This  makes  the  transputer  sys¬ 
tem  an  attractive  alternative. 

There  is  a  wide  range  of  military  applications 
of  transputers;  the  British  Ministry  of  Defence 
(MOD)  started  the  parallel  processing  program  in 
the  early  1980s.  The  first  parallel  machines  were 
based  on  transputers  because  of  their  cost-effective¬ 
ness,  extensibility,  and  reconfigurability.  Applica¬ 
tion  areas  include:  embedded  real-time  control 
systems,  image  processing,  speech  recognition, 
battlefield  simulation,  missile  system,  and  terrain 
line  of  sight  modeling.  Three  levels  of  parallel 
systems  are  distinguished,  based  on  the  application: 

•  low  level  (2  to  8  transputers), 

•  medium  level  (9  to  32  transputers),  and 

•  high  level  (more  than  32  processors). 

The  medium-size  systems  are  most  popular 
within  the  MOD.  "Mousetrap"  is  a  miniaturized 
version  of  a  supemode,  with  16  transputers  within 
a  package  the  size  of  a  shoebox.  The  mousetrap 
can  be  battery  powered  (less  than  40  watts  is  need¬ 
ed),  and  it  delivers  180  MIPS  and  20-40  MFLOPS. 
Primary  applications  developed  for  Mousetrap 
include  terrain  analysis,  terrain  modeling,  and 
intervisibility  plots.  (An  intervisibiiity  plot  is  a 
plan  view  of  an  area  showing  those  areas  that  are 
visible  from  a  given  observation  point.)  An 
intervisibiiity  plot  can  be  attained  within  five  sec¬ 
onds. 

Archipel,  a  company  in  France,  produces 
commercials  and  movies  developed  on  a  9-transpu¬ 
ter  system  for  image  synthesis  called  VOLVOX. 
This  l^)plication  demonstrates  that  direct  parallel 
programming  is  a  viable  qtproach  in  special-pur¬ 
pose  system  design.  The  architecture  is  tailored  to 
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video  synthesis  applications  and  a  reconfigurable 
network  does  not  have  to  be  formed.  The  fixed- 
topology  architecture  gives  optimum  performance 
at  affordable  price.  The  VOLVOX  system  can 
easily  be  attached  to  a  personal  computer  oi  a 
standard  workstation  such  as  SUN  or  Apollo. 

NUCHEF-H  in  Amsterdam  has  developed  a 
data  acquisition  and  processing  system  called  Zeus 
that  has  more  than  500  transputers  for  high-energy 
physics  experiments.  The  system  is  mainly  used 
for  data  acquisition,  data  compression,  event  selec¬ 
tion,  and  data  monitoring.  The  high-energy  phys¬ 
ics  experiments  aim  at  identifying  elementary 
particles,  determining  their  properties,  and  analyz¬ 
ing  their  interactions.  Zeus  is  also  used  at  HE^, 
an  electron-proton  collider  that  is  located  in  Ham¬ 
burg,  FRG.  Zeus  is  a  special-purpose  architecture 
where  transputers  are  connected  in  various  configu¬ 
rations:  VME  bus,  16-way  crossbar,  and  a  tree. 

To  realize  the  complexity  of  the  entire  system, 
bear  in  mind  that  the  transputer-based  system  is 
only  a  front  end  to  a  processor  built  of  MIPS 
R-3000  microprocessors  that  aiuilyze  the  entire  data 
collected  and  preprocessed  by  the  transputer  sys¬ 
tem.  The  reason  for  choosing  transputers  at  the 
front  end  were: 

•  direct  memory  access  communication 
capability  (on  all  four  links); 

•  simplicity  of  communication  by  using 
transputer  and  communication  chip-sets; 

•  efficient  multitasking  (task  switching  takes 
a  few  microseconds); 

•  acceptable  processing  power  at  low  cost; 
and 

•  support  in  OCCAM  for  parallel  processing, 
interprocess  communication,  and  synchro¬ 
nization,  thereby  eliminating  the  need  for 
an  operating  system. 

All  in  all,  Zeus  is  the  largest  transputer-based 
system  built  to  date  for  physics  triplications. 

The  final  exanqile  (from  Bosch  Company  in 
Stuttgart)  is  object-oriented  simulation  in 
mechatronics  (m^hanical  and  electronic  system). 

A  typical  exanqile  of  a  state-of-the-art  mechatronic 
device  is  an  Anti-blocking  Braking  System  (ABS), 
which  is  a  verv  conqilex  combination  of  sensors, 
microprocessor-based  controllers,  electromagnetic 
actuators,  as  well  as  mechanical  and  hydraulic 


components.  Several  examples  have  demonstrated 
how  hierarchical  decomposition  and  modularity  can 
help  in  model  development  that  is  then  directly 
mapped  onto  a  physical  multitransputer  system. 

This  approach  has  proven  to  be  effective  for  inter¬ 
active  simulations.  The  software  tool  that  was 
developed  for  simulation  support  is  based  on  two 
concepts:  "baseblocks"  and  "structureblocks." 
Baseblocks  correspond  to  object  and  can  be  fully 
characterized  by  using,  for  exarnple,  algebraic  and 
difrerential  equations.  Structureblocks  provide  a 
"glue"  to  connect  the  modules  together  into  one 
coherent  system.  The  8-transputer  system  with  a 
powerful  workstation  front  end  has  been  imple¬ 
mented  at  Bosch  and  serves  as  an  interactive  simu¬ 
lator  for  a  number  of  applications. 

Summary 

An  amazingly  large  spectrum  and  diversity  of 
special-purpose  transputer-based  systems  have  been 
implement^  and  operate  at  European  universities 
and  laboratories.  These  systems  have  workstations 
or  powerful  personal  computers  as  front  ends;  they 
typically  use  up  to  10  transputers  interconnected  in 
customized  fashion,  or  they  may  contain  up  to  100 
connected  in  a  two-dimensional  array. 

The  range  of  applications  is  quite  spectacular; 
pneumatics  and  hydraulics  systems  modeling, 
particle  flow  measurement,  Kalman  filters,  and 
several  real-time  control  systems.  Examples  in¬ 
clude  automatic  flight  control,  vibration  suppres¬ 
sion,  turbine  control  and  robotics,  image  recon¬ 
struction,  radiological  visualization  (for  medicine), 
oceanography  and  meteorology,  real-time  traffic 
monitoring,  spectral  elements  methods  for  compu¬ 
tational  fluid  dynamics,  real-time  image  analysis, 
graph  isomorphism,  textural  analysis,  protein 
structures  prediction,  gate-level  timing  simulation, 
radiation  engineering,  and  product  label  inspection. 

With  the  availability  of  the  T9000  transputer, 
ASIC  extensions,  and  a  wide  range  of  communica¬ 
tion  chips,  the  special-purpose  systems  will  flourish 
and  become  highly  conqwtitive  to  general-purpose 
systems,  esp«:ially  if  they  can  be  incorporated  as 
application  accelerators  on  engineering 
workstations.  This  is  one  of  the  few  areas  in 
which  quantifiable  leadership  can  be  developed  by 
any  group  who  is  ahead  in  a  given  iq)plication 
area. 
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DISTRIBUTED  COMPUTING  SYSTEMS 

High-performance,  flexible,  secure,  reliable, 
distributed  computing  is  becoming  extremely  im¬ 
portant.  With  networks  spanning  the  entire  world 
and  the  downsizing  of  computers  and  increase  in 
their  power,  computing,  printing  and  communica¬ 
tion  resources  will  have  to  be  able  to  respond 
efficiently  to  serve  millions  of  users.  This  means 
that  their  interactions  will  have  to  be  coordinated. 

It  is  estimated  that  by  the  year  2000,  80  percent  of 
business,  education,  and  research  computers  will 
be  linked  by  wide-area  nets,  and  remote  processing 
will  be  as  fast  as  today’s  local  processing.  The 
challenges  are  immense,  and  numerous  projects  are 
under  way.  What  follows  are  examples  of  Europe¬ 
an  efforts  that  are  focused  on  providing  distributed 
computing  with  a  variety  of  much  needed  qualities. 
These  qualities  include  security,  global  naming, 
reliability,  and  real-time  cooperative  work  support. 
Operating  environments  are  highly  diverse,  and  no 
one  has  the  answer  to  all  the  problems  and  chal¬ 
lenges  that  are  facing  designers  and  users  of  these 
systems. 

AMOEBA 

Synopsis 

Amoeba  is  a  distributed  computing  system  that 
was  designed  and  implemented  at  the  Free  Univer¬ 
sity  and  the  Centre  for  Mathematics  and  Computer 
Science  in  Amsterdam.  Amoeba’s  goals  are  to 
develop  next-generation  systems  that  deliver  high 
performance,  high  availability,  parallelism,  and 
reliability  with  the  ease  of  use. 

The  Amoeba  project  has  been  under  way  for 
nearly  12  years  and  has  undergone  several  rede¬ 
signs  and  reimplementations.  The  Amoeba  4.0 
system  was  released  in  1990. The  architecture 
and  software  are  unique,  and  performance  is  im¬ 
pressive,  especially  with  respect  to  remote  proce¬ 
dure  calls  (RPCs). 

The  Amoeba  kernel  is  small  and  simple.  It  is 
easy  to  port  on  different  hardware  platforms.  It 
currently  rum  on  VAX  systems,  M68020,  M6830, 
and  Intel  80386. 


Architecture 

The  Amoeba  hardware  consists  of  four  compo¬ 
nents:  pools  of  processors,  workstations,  special¬ 
ized  servers,  and  gateways.  Workstations  are  used 
for  processes  that  require  intensive  user  interac¬ 
tions;  processor  pools  provide  the  computing  pow¬ 
er  for  applications.  Some  applications  may  execute 
in  parallel  on  the  pools  of  processors.  Software 
allows  dynamic  reconfiguration.  The  number  of 
processors  may  vary  depending  on  the  system  type 
and  availability  of  processors  (busy,  fault-free, 
faulty). 

Specialized  servers  are  machines  that  are 
matched  with  processes  requiring  specific  resources 
(e.g.,  printers,  larger  disks  for  database  applica¬ 
tion).  Gateways  connect  a  number  of  geographi¬ 
cally  distribute  Amoeba  systems  over  wide  area 
networks.  A  prototype  at  Free  University  has  48 
processors  in  its  processor  pool. 

Software 

Amoeba  uses  an  object-oriented  approach  to 
clients  and  servers.  Each  object  is  identified  and 
protected  by  a  capability  that  has  enough  redundan¬ 
cy  in  a  48-bit  check  field  to  assure  cryptographic 
protection.  This  is  in  addition  to  service  port 
number,  object  number,  and  access  rights.  Client 
processes  use  remote  procedure  calls  to  send  re¬ 
quests  to  server  processes  that  implement  and 
manage  objects. 

Amoeba  has  a  unique  fast  file  system  compris¬ 
ing  two  parts:  the  bullet  service  is  a  simple  file 
server  that  stores  files  as  contiguous  byte  strings 
both  on  disk  and  cache  (it  is  faster  than  a  speeding 
bullet,  hence  the  name);  the  directory  service  gives 
symbolic  names  to  capabilities  and  handles  replica¬ 
tion  and  atomicity,  thereby  eliminating  the  need  for 
a  separate  transaction  system.  To  preserve  conti¬ 
nuity  with  existing  systems,  Amoeba  has  a  UNIX 
emulation  facility  consisting  of  a  library  of  UNIX 
system  call  routines. 

Amoeba  has  very  efficient  communication 
capabilities,  especially  the  fast  remote  procedure 
call  (RPC)  provided  by  its  kernel.  It  outperfomis 
SUN  Microsystems  RPC  by  orders  of  magnitude  in 
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terms  of  delay  and  bandwidth.  However,  it  has  a 
limitation  of  handling  only  immutable  files. 

The  communication  is  secure  by  providing  a 
hardware  or  software  "box"  with  48-bit  numbers 
known  only  to  the  server  processes  that  make  up 
the  service  and  to  the  server’s  clients.  A  small 
interface  box  or  a  board  is  placed  between  each 
processor,  and  the  network  has  a  secret  code  set  on 
a  chip.  Cryptographic  algorithms  are  used  in  a 
"software  box"  that  gives  the  same  functional 
effect  as  a  hardware  box  or  a  board.  All  messages 
entering  or  leaving  a  processor  m>!st  be  authenticat¬ 
ed  by  matching  a  header  P  of  the  arriving  messag¬ 
es  with  the  header  P  calculated  by  the  box.  The 
header  is  a  function  of  secret  port  number  G.  P  is 
a  one-way  mapping  of  G  such  that  given  G  it  is 
easy  to  calculate  P  =  fiG),  but  it  is  not  feasible  to 
find  G  if  P  is  given. 

Like  most  systems  today,  the  file  system  in 
Amoeba  is  using  a  hierarchical  directory  structure. 
A  directory  is  a  set  of  name/capability  pairs. 
Directory  operations  on  its  objects  are  simply 
backup,  enter,  and  delete.  The  service  can  be  set 
up  so  that  the  directory  service  asks  whether  to 
niake  n  replicas  of  the  object  that  then  can  be 
distributed  on  different  servers  for  increased  reli¬ 
ability.  The  earlier  mentioned  bullet  service  is  a 
special  type  of  file  server  that  supports  fast  execu¬ 
tion  of  three  basic  file  operations  (read,  create,  and 
delete),  assuming  that  file  contents  cannot  be 
changed. 

Immutable  files  are  easy  to  manipulate  and  the 
replication  mechanism  is  simple  and  efficient. 
Amoeba’s  file  system  is  an  excellent  example  of  a 
tradeoff  between  functionality,  speed,  and  simplici¬ 
ty  of  design  and  implementation. 

The  process  management  in  Amoeba  is  uncon- 
veitional:  remote  execution  is  considered  normal 
while  local  execution  is  viewed  as  the  exception. 
Processes  can  be  remotely  created,  destroyed, 
checiqpointed,  migrated,  and  debugged.  Processes 
can  be  in  two  states;  running  or  stunned  (e.g., 
being  debugged,  attempting  to  communicate, 
faulty). 

14ore  than  150  UNIX  utilities  run  on  Amoeba, 
in  addition  to  the  X  Windows  System,  which  sup¬ 
ports  TCP/IP  and  Amoeba  RPC  communication 
protoct^ls. 


Conclusions 

Amoeba  is  definitely  one  of  Europe’s  most 
advanced  and  competitive  projects  in  the  area  of 
distributed  computing.  In  many  aspects  it  com¬ 
pares  favorably  with  its  most  exciting  competitor  in 
Europe— Chorus,’^  as  well  as  with  Locus, 

Mach,**  the  V-kemel,*^  and  variations  of  UNIX 
systems.  It  is  a  unique  parallel/distributed  comput¬ 
er  system  with  numerous  powerful  and  clever  ideas 
regarding  file  management,  security,  reliability, 
and  communication. 

For  further  information  contact: 

Andrew  S.  Tanenbaun 
Free  University  of  Amsterdam 

or 

Sape  L.  Mullender 
CWI 

P.O.  Box  4079 
1009  AB  Amsterdam 
The  Netherlands 

ADVANCED  NETWORK  SYSTEM 
ARCHITECTURE  (ANSA) 

Synopsis 

ANSA  was  started  as  part  of  a  U.K.  high- 
technology  research  initiative  project  called  Alvey 
in  Cambridge,  U.K.,  in  1984.  It  was  supported  by 
several  major  companies  including  British 
Telecom,  DEC,  Hewlet-Packard,  ICL,  STC,  ITL, 
Racal,  Olivetti,  GEC-Marconi,  and  GEC-Plessey. 
The  project  was  very  successful,  and  it  was  ex¬ 
panded  under  ESPRIT  under  a  project  called  ISA 
with  wider  collaboration.  These  additional  compa¬ 
nies  included  Siemens,  AEG/ ATM,  France 
Telecom,  Swedish  Telecom,  Ericsson  Telecom, 
Chorus  Systems,  Thomson,  Technical  University 
of  Vienna,  and  others.  The  Cambridge  laboratory 
became  APM  Ltd,  in  1989. 

ANSA  was  a  predecessor  of  Open  Software 
Foundation  and  its  goals  continue  to  be  standard¬ 
ization,  technology  transfer,  and  knowledge  dis¬ 
semination.  These  are  combined  with  basic  re¬ 
search  on  foundations  of  distributed  systems,  re¬ 
search  and  development  of  specific  engineering 
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techniques  for  distributed  systems,  training,  con¬ 
sulting,  as  well  as  supply  and  support  of 
ANSA  wars.  ANSA  ware  is  the  entire  spectrum  of 
software  for  building  open  distributed  systems. 
ANSAware  technology  has  been  used  in  the  As¬ 
trophysics  Data  System  at  NASA.  It  is  considered 
to  be  one  of  the  world’s  largest  distributed  process¬ 
ing  systems. 

Software  Architecture 

The  main  goal  of  ANSA  is  to  support  the 
design,  implementation,  operation,  and  evaluation 
of  distributed  systems.  The  emphasis  is  on  evolu¬ 
tion,  not  revolution.  ANSA  developers  believe 
that  with  the  current  level  of  reliance  on  comput¬ 
ers,  the  only  practical  (and  in  fact  feasible)  way  to 
expand  their  systems  will  be  by  an  evolutionary 
process.  ANSA  supports  heterogeneous  environ¬ 
ments  where  application  packages,  operating  sys¬ 
tems,  computers,  disks,  and  network  may  come 
from  different  vendors. 

ANSAware  is  a  suite  of  C  programs  that  can 
be  installed  on  UNIX.  MSDOS.  and  VMS  sys¬ 
tems.  Parts  of  other  operating  systems  [such  as 
UNIX  V.4,  Mach.  Chorus,  SCO-Xenix,  and  Ultrix 
(Digital’s  UNIX)]  are  also  available.  ANSAware 
enables  a  distributed  application  to  be  established 
with  minimal  changes  and  overheads  to  existing 
software.  It  provides  a  uniform  view  of  a 
multivendor  system  and  customizes  user's  applica¬ 
tions.  ANSAware  adheres  to  the  most  widely 
established  standards.  The  support  for  parallel 
processing  and  synchronization  is  provided.  ANS¬ 
Aware  also  facilitates  an  object  based  approach. 
ANSAware  encapsulates  a  program  and  its  data. 
APM  Ltd  has  a  number  of  cooperative  agreements 
with  companies  which  provide  specific  applica¬ 
tions.  Example  include: 

1.  Astrophysics  Data  System  (ADS),  which 
provides  scientists  with  the  ability  to  access 
and  manipulate  data  from  U.S.  and  Euro¬ 
pean  space  instruments,  including  remote 
databases.  It  uses  ANSAware  with  Ellery 
Systems  (Boulder.  Colorado)  software. 

2.  STC  Technology  Ltd.  from  London  has 
developed  a  number  of  object-oriented 


applications,  including  cooperative  work 
(Multiworks)  and  banking  (Bank  92). 

3.  ANSAware  on  top  of  Chorus/Mix,  an 
operating  system  developed  by  Chorus 
Systemes  (France),  accelerates  distributed 
applications  by  20  to  70  percent.  The  key 
to  this  performance  enhancement  lies  in 
lightweight  processes  from  Chorus  and 
optimized  interprocess  communication. 

Conclusions 

A  quote  from  APM  Ltd’s  advertizing  brochure 
says  it  all:  "We  looked  at  the  research  being  under¬ 
taken  worldwide.  Harvested  it,  integrated  it,  and 
made  it  come  together  into  a  coherent  and  consis¬ 
tent  architecture. "  There  is  a  wealth  of  concepts 
and  solid  foundations  implemented  in  ANSAware, 
which  will  benefit  both  the  research  and  the  devel¬ 
opment  community  in  the  opi^n  distributed  comput¬ 
ing  environment. 

DELTA4 

Synopsis 

DELTA-4  (Definition  and  Design  of  open 
Dependable  Distributed  System  architecture)  was  a 
concerted  effort  of  academic,  research,  and  indus¬ 
trial  communities  under  the  auspices  of  ESPRIT- 1 
and  2  (1986-1991)  to  design  and  implement  a 
highly  reliable  distributed  computing  system.  It 
had  a  budget  of  about  $35M,  and  the  working 
.system  was  demonstrated  in  late  1991.  The  system 
was  developed  at  LAAS-CNRS  (Toulouse,  France) 
with  the  participation  of  University  of  Newcastle 
(U.K.),  LEI  (France).  lEI  (Italy),  MARI  (U.K.), 
IITB  (FRG),  INESC  (Portugal),  Ferranti  (U.K.), 
Bull  (France),  and  SEMA  (France).  Credit 
Agricole  (France)  with  Renault  (France)  got  in¬ 
volved  as  system  users.  The  wide  range  of  objec¬ 
tives  and  requirements  included  distribution,  real 
time,  high  performance,  heterogeneity,  openness, 
application  software,  communications,  portability, 
and  dependability. 

This  extremely  ambitious  project  resulted  in 
several  interesting  concept  whose  details  are  de¬ 
scribed  in  the  next  section. 
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Architecture  and  Software 

The  generic  DELTA-4’s  architecture  compris¬ 
es  computation  nodes  and  a  message-passing  com¬ 
munication  network.  Nodes  are  off-the-shelf  ma¬ 
chines  that  require  minimum  redesign  for  technolo¬ 
gy  updates,  and  the  network  is  standard  ethemet. 
The  major  emphasis  is  on  dependability,  with  focus 
on  high  reliability  (continuity  of  delivered  service) 
and  security  (preservation  of  confidentiality  and 
integrity).  Fault  classes  tolerated  by  the  system 
include: 

1.  Hardware  faults  (accidental,  internal,  phys¬ 
ical)  which  are  tolerated  by  distribution, 
including  file  scattering. 

2.  Intrusion  (deliberate,  external,  human) 
which  are  handled  by  fragmentation  and 
scattering. 

3.  Software  faults  (accidental,  internal,  hu¬ 
man)  are  tolerated  by  diversified  computa¬ 
tion. 

Main  approaches  to  fault  tolerance  include; 

•  application-specific  fault  tolerance; 

•  stable  storage  mechanisms  (periodical 
dump,  transaction  mechanism  with  replica¬ 
tion  and  access  of  storage  from  multiple 
sites);  and 

•  replicated  processing  (lots  of  thinking  went 
into  replica  determinism  to  ensure  that  the 
outcome  of  replicas  is  the  same  as  that  of 
the  original). 

Units  of  fault  tolerance  are  individual  nodes  of 
a  distributed  system;  units  of  replication  are  soft¬ 
ware  components  on  distinct  nodes.  In  addition, 
an  application  can  be  executed  in  a  distributed 
mode  on  several  nodes  by  using  message-passing 
communication  and  distributed  memory.  Replicat¬ 
ed  software  components  are  capable  of  masking 
errors  caused  by  faulty  nodes.  Elimination  of 
errors  (error  processing,  "symptom  relief")  is 
accomplished  by  standard  error  detection/recovery 
or  error-compensation.  Error  reporting  triggers 
fault  treatment  (elimination  of  faults,  "cure  of 
illness").  This  involves  fault  diagnosis  and  recon¬ 


figuration  or  repair.  One  method  of  reconfigura¬ 
tion  for  software  components  is  achieved  by  clon¬ 
ing  new  replicas. 

Each  node  of  the  DELTA-4  system  has  a 
self-checking  Network  Attachment  Controller 
(NAC)  that  is  fail-silent  in  addition  to  a  standard 
workstation  that  is  usually  fail-uncontrolled.  Fail- 
silent  implies  perfect  self-check  and  no  value- 
errors.  NAC  serves  as  a  guard  against  data  incon¬ 
sistency  and  a  network  interface.  Atomic  multicast 
ensures  input  consistency  for  active  replicas  with 
competitive  or  round-robin  output  validation.  For 
passive  replicas,  primary  and  back-up  processing 
nodes  are  used  with  transaction,  systematic,  or 
periodic  checkpoints.  Semiactive  replicas  involve 
leader  and  follower  (shadow  processing).  Novelty 
here  is  that  the  leader  executes  both  deterministic 
and  nondeterministic  processing  and  in  the  absence 
of  faults  does  not  send  output  messages.  The 
leader  also  instructs  the  follower  regarding 
nondeterministic  decisions. 

There  are  two  basic  variants  of  DELTA-4 
architecture; 

1 .  D4-OSA:  Open  System  Architecture, 
which  adheres  to  open  system  standards. 

2.  D4-XPA:  Extra  Performance  Architecture, 
which  emphasizes  performance  and  real 
time.  It  is  restricted  to  homogeneous 
nodes  and  provides  explicit  support  for 
local  executives  and  communication  to 
ensure  timeliness. 

The  multipoint  communication  system  supports 
all  seven  layers  of  protocols  in  dependable  manner 
by  using  replication  of  sources  and  destinations  as 
well  as  multicasting. 

The  unique  features  of  system  administration 
are  multicast  services,  built-in  processing  of  com¬ 
putational  errors,  and  management  of  both  commu¬ 
nication  and  computation  resources  (in  addition  to 
standard  network  management  functions).  Three 
basic  management  tasks  on  DELTA-4  are; 

•  planning  and  integration  of  distribution  and 
redundancy, 

•  control  and  monitoring  of  system  behavior, 
and 

•  maintenance  support  and  fault  treatment. 
Both  static  and  dynamic  mechanisms  are  used. 
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To  validate  the  design,  protocols  were  verified 
and  dependability  was  evaluated  by  using  Markov 
modeling  techniques.  Extensive  fault  injection 
experiments  were  conducted  and  software  reliabili¬ 
ty  was  evaluated. 

Conclusions 

DELTA-4  is  one  of  Europe’s  most  significant 
entries  in  dependable  distributed  computing.  A 
number  of  concepts  have  been  proposed  and  imple¬ 
mented.  Pile  scattering,  numerous  replication 
methods,  extensive  fault  injection  techniques,  and 
software  reliability  evaluation  (especially  reliability 
growth)  are  already  tangible  accomplishments  of 
this  project.  Also,  demonstrating  that  dependable 
systems  can  be  built  from  not-so-dependable 
off-the-shelf  components  is  important. 

Installations  of  DELTA-4  systems  at  industrial 
sites  should  prove  the  usefulness  of  these  concepts 
in  engineering  practice  and  will  give  excellent 
experimental  data  about  specific  ideas  that  should 
be  further  pursued  and  disseminated. 

For  further  information  contact: 

David  Powell 
LAAS-CNRS 

7  Avenue  du  colonel  Roche 
31077  Toulouse  Cedex,  France 

REX 

Synopsis 

The  Reconfigurable  and  Extensible  Parallel 
and  Distributed  Systems  (REX)  project  is  part  of 
the  ESPRIT  program.  It  has  a  ^ndamental  goal  of 
designing  an  integrated  methodology  and  associated 
tools  for  the  development  and  management  of 
parallel  and  distributed  systems.  The  project  began 
in  May  1989  and  is  to  run  for  five  years.  Special 
attention  is  being  given  to  exploring  the  potential 
for  system  reconflguration  and  extension. 

The  project  is  headed  by  Imperial  College  in 
London  (U.K.),  with  the  participation  of  academics 
from  Technical  University  of  Berlin  (FRG),  GMD- 
Karlsruhe  (FRG),  Programming  Research  Group  of 
Oxford  University  (U.K.),  and  imlustrial  partners 
from  Stollman  GmbH  (FRG),  Siemens  AG  (FRG), 


GSI  Tecsi  (Paris,  France),  Intercom  S.A.  (Greece), 
and  2i  Industrial  Informatics  GmbH  (FRG). 

REX  applications  are  focused  on  industrial 
automation  and  telecommunications.  The  approach 
is  based  on  configurations  of  modular  software  and 
hardware  components  as  the  framework  for  specifi¬ 
cation,  analysis  and  evaluation,  methodology, 
programming  methods,  dynamic  configuration  and 
reconfiguration,  runtime  support,  and  monitoring 
for  distributed  and  parallel  computing  systems. 

This  project  tackles  some  core  problems  that 
are  vital  in  the  design  and  implementation  of  paral¬ 
lel  and  distributed  computing  systems. 

Architecture  and  Software 

The  REX  project  aims  at  developing  an  inte¬ 
grated  design  methodology  and  associated  support 
tools  for  the  development  and  management  of 
parallel  and  distributed  systems;  its  main  emphasis 
is  on  configuration  structure.  Software  systems  are 
described  and  constructed  as  configurations  of 
interacting  software  components  executed  on  mod¬ 
ular  hardware  components. 

This  configurability-based  approach  is  expected 
to  result  in  flexible,  clear,  and  modular  designs. 
The  structural  view  will  also  be  useful  in  facilitat¬ 
ing  change  and  evolution  of  a  distributed  comput¬ 
ing  system  by  dynamic  configuration.  This  ap¬ 
proach  may  also  help  to  develop  easily  comprehen¬ 
sible  and  maintainable  object-based  designs. 

The  multiparadigm  approach  focuses  on  multi¬ 
ple  specification  methods,  a  variety  of  analysis 
techniques,  multiple  programming  languages  as 
well  as  their  integration.  Both  formal  and  informal 
techniques  will  be  used  to  support  top-down  and/or 
hottom-up  design.  The  REX  Engineering  Environ¬ 
ment  will  facilitate  novel  development  techniques 
and  system  evolution. 

REX  views  paraliel/distributed  systems  as  sets 
of  components  (objects)  that  encapsulate  state  and 
have  well-defined  interfaces.  The  interfaces  are 
specified  by  an  Interface  Specification  Language 
(ISL).  Components  can  be  replicated,  and  their 
instances  may  execute  in  parallel.  The  sets  of 
instances  with  their  interconnection  are  described 
by  a  special  configuration  language  called  Darwin. 
Darwin  supports  both  arbitrary  and  hierarchical 
topologies  1^  allows  composite  components. 
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Dynamic  structural  changes,  system  extensions, 
and  modifications  can  also  be  defined  by  Darwin. 

Component  programming  supports  communi¬ 
cation  primitives  (integrated  with  C/C++  and 
Modula  2  languages,  compilers).  UNIX  hosts 
runtime  support  including  M6800  and  transputers; 
Object  Selection  Language  (OSL)  facilitates  generic 
configuration  change. 

Specific  tools,  languages,  and  protocols  have 
been  developed  (or  will  be  developed)  to  support  a 
wide  spectrum  of  specification  methods,  techniques 
for  system  development,  integration,  analysis, 
evaluation,  and  design. 

System  Specification 

A  number  of  methodologies  has  been  devel¬ 
oped  to  describe  the  structure  and  behavior  of 
parailel/distributed  systems. 

Wanda  is  a  specification  language  for  describ¬ 
ing  parallel,  time-critical  systems  whose  formal 
framework  is  based  on  timed-CSP  (timed  Commu¬ 
nicating  Sequential  Processes). 

Specification  and  Description  Language  (SDL), 
standardized  by  the  CCITT,  is  a  language  used  for 
specifying  communication  protocols.  SDL  process 
diagrams  can  be  modeled  with  REX  tools. 

SPECnets  are  a  variation  of  high-level  Petri 
nets  based  on  first-order  predicate  logic;  they  can 
be  used  for  system  modeling  to  specify  and  verify 
certain  system  properties. 

Timed-CSP  is  a  language  for  specifying  and 
reasoning  about  liveness,  deadlock,  and  timing. 

Methods  for  System  Development 

REX  aids  in  the  systematic  development  of 
software  for  distributed  computing  systems,  with 
emphasis  on  the  evolution  by  stepwise  refinement 
and  configurability.  Two  approaches  have  been 
developed;  a  formal  one  that  uses  Wanda,  and  an 
informal  one,  called  Constructive  Design  Approach 
(CDA),  whose  description  follows. 

Constructive  Design  Approach  is  based  on  two 
principles; 

•  explicit  system  structures,  and 

•  context-independent  components. 


Starting  with  a  real-world  application,  the  main 
processing  components  and  data  flow  among  them 
are  identified.  This  leads  to  structural  description, 
a  graph  that  connects  processes  with  component 
types,  and  interface  specification.  Finally,  compo¬ 
nents  can  be  further  decomposed  or  described  in 
detail  in  a  programming  language. 

Rex  Designer  is  a  design  tool  that  supports 
CDA.  It  provides  an  environment  in  which  soft¬ 
ware  structures  can  easily  be  captured,  edited, 
modified,  and  displayed  in  various  forms  on  a 
multiwindow  system.  The  tool  also  facilitates  the 
automatic  translation  of  a  design  into  a  Darwin 
configuration  program,  which  can  then  be  com¬ 
piled.  An  additional  tool,  called  REXDraw,  sim¬ 
plifies  manipulation  of  a  design  diagram  and  allows 
monitoring  and  visualization  of  running  REX  pro¬ 
grams  in  a  network  of  workstations. 

Int^ration  Frameworks 

Viewpoint  can  focus  on  a  specific  aspect  of  a 
system  design  and  display  a  specific  knowledge 
about  this  aspect.  Viev/Point  templates  may  run 
through  the  entire  hierarchy  of  the  design  and  its 
various  aspects,  therefore  they  facilitate  system 
integration. 

TELan  is  a  language  for  uniformly  describing 
the  structure  of  an  application  system. 

Analysis  and  Evaluation 

The  system  behavior  and  performance  can  be 
analyzed  and  validated  by  a  set  of  tools  or  by  the 
use  of  formal  techniques  such  as  specification 
language  Wanda.  The  REX  Analysis  Toolset 
comprises  OOPS,  ddraw,  ViSiT,  and  The  Visualiz- 
er. 

OOPS  (Object-Oriented  Petri  Net  Simulator)  is 
a  discrete-event  Monte-Carlo  simulator  that  has  an 
extended,  timed  Petri  nets  model  with  hierarchy 
and  predicate/transition  mechanisms,  ddraw  is  a 
graphical  tool  that  produces  descriptions  of  the 
Petri  nets;  ViSiT  is  a  graphical  tool  for  preparing 
and  displaying  the  analysis  results  delivered  by 
ddraw.  The  Visualizer  is  used  for  graphical  repre¬ 
sentation  of  extracted  analysis  data.  DISYS  (Dis¬ 
tributed  SYstem  Simulator)  is  a  tool  for  simulating 
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specifications  of  distributed  systems.  It  can  pro¬ 
cess  descriptions  in  a  form  of  parameterized  dia¬ 
grams,  Petri  nets,  queuing  nets,  or  extended  timed- 
-Petri  nets.  There  is  also  an  additional  tool  that 
can  transform  SDL  (Standard  Design  Language) 
specifications  into  the  DISYS  model. 

The  feasibility  of  the  REX  approach  has  been 
demonstrated  for  conq)uter-integrated  manufactur¬ 
ing  (a  small  repair  shop)  and  telecommunication  (a 
network  of  telephone  exchanges).  Extension  of  the 
latter  project  aims  at  prototyping  a  full  ISDN 
private  exchange. 

Conclusions 

REX  is  one  of  the  most  inq>ortant  projects  in  a 
quickly  emerging  area  of  configurable  distributed 
systems.  The  REX  approach  is  based  on  the  fol¬ 
lowing  principles: 

•  separate  configuration  language  for  struc¬ 
tural  description, 

•  context-independent  component  types, 

•  complex  components  composed  of  instanc¬ 
es  of  component  types,  and 

•  dynamic  evolution  as  a  change  to  the  com¬ 
ponent  instances  and/or  their  interconnec¬ 
tion. 

A  vast  spectrum  of  tools  has  been  developed  to 
support  design,  specification,  analysis,  program¬ 
ming,  run-time  support,  and  system  integration  in  a 
distributed  system  environment.  Only  time  will 
show  how  in:q)ortant  these  developments  are  in 
harnessing,  controlling,  and  configuring  distributed 
conq>uting  systems  that  grow  at  rampant  pace.  It 
is  also  crucial  that  full-scale  demonstrations  be 
developed  to  show  the  power  and  potential  of  the 
REX  system. 

For  fiirther  information  on  REX  contact: 

Jeff  Kramer 

Dqiartment  of  Conqiuting 
Im^rial  College 
180  (Queens  Gate 
London  SW7  2BZ 
United  Kingdom 


4.  SOFTWARE 
INTRODUCTION 

Europe’s  great  potential  is  in  software  technol¬ 
ogy'.  An  overview  of  parallel  languages  is  given  ; 
it  is  abstracted  from  an  excellent  report  by  R.H. 
Perrot,**  with  my  additiorud  comments.  Two 
European  operating  systems  [Chorus  (commercial) 
and  PEACE  (experimental)]  are  then  described. 
Chorus  appears  to  be  a  strong  competitor  to  UNIX 
and  Mach.  The  final  part  of  this  section  describes 
the  European  Software  Systems  Initiative  (ESSI), 
whose  goals  focus  on  providing  a  solid  infrastruc¬ 
ture  for  quality  software  development,  good  train¬ 
ing  base  for  software  engineers,  standards,  and 
reuse  library. 

PARALLEL  LANGUAGES 
Overview 

With  a  formidable  tradition  in  sequential  lan¬ 
guages,  Europe  enters  parallel  languages  research 
from  a  position  of  strength  and  parity  with  devel¬ 
opments  elsewhere.  At  the  core  are  parallel  systems 
models  (see  section  on  Basic  Definitions,  Concepts, 
Taxonomies).  For  example,  programming  on 
SIMD  machines  seems  to  be  simpler  than  on 
MIMD  computers  as  is  operating  in  a  global  ad¬ 
dress  space  instead  of  local.  The  models  may 
affect  many  facets  of  parallel  systems:  applica¬ 
tions,  languages,  operating  systems,  and  architec¬ 
tures. 

The  variety  of  computational  models  has  re¬ 
sulted  in  parallel  software  diversity.  A  parallel 
langi;.age  family  can  be  divided  into  declarative  or 
imperative  languages  Declarative  languages  can 
be  subdivided  into  functional  and  logical;  impera¬ 
tive  ones  consist  of  procedural  and  object-oriented 
languages. 

This  diversity  in  languages  arises  because  there 
is  a  tradeoff  between  programmability  and  efficien¬ 
cy.  It  has  resulted  in  two  fundament  program¬ 
ming  methodologies:  implicit  and  explicit.  Im¬ 
plicit  methods  offer  ease  of  code  development  but 
inefficient  program  execution;  explicit  techniques 
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in^)rove  execution  efficiency  but  make  program¬ 
ming  more  difficult.  Other  issues  include  readabil¬ 
ity,  expressiveness,  and  reliability  (coirectness). 

The  three  approaches  to  parallel  programming 
are: 

«  Using  a  sequential  code  ("dusty  deck")  and 
relying  on  a  "parallelizer"  (a  special  com¬ 
piler  that  detects  parallelism  and  modifies 
the  code  such  that  it  can  be  executed  in 
parallel).  This  is  a  parallel  systems  users 
dream,  but  at  this  time  this  "portability" 
from  sequential  to  parallel  computers  is 
mainly  for  DO  loops.  A  lot  of  work  is 
being  done  world-wide  in  this  area,  but  a 
more  concerted  research  is  badly  needed. 
Significant  progress  has  been  made  in 
semiautomatic  (with  programmer’s  partici¬ 
pation)  parallelization.  One  of  the  most 
interesting  parallelizing  compilers  is 
ParaSoft,  which  was  developed  at  CalTech 
and  works  best  on  transputer-based  sys¬ 
tems. 

•  Extending  an  existing  sequential  language 
to  represent  parallel  constructs.  Here 
again,  a  substantial  research  effort  is  in 
progress,  but  so  far  rather  limited  success 
has  been  noted.  There  are  parallel  For¬ 
tran,  high-performance  Fortran,  Fortran 
90,  parallel  Pascal,  and  other  extensions, 
but  they  frequently  are  architecture-  or 
model-dependent,  and  debugging  seems  to 
be  difficult. 

•  Developing  a  new  parallel  language.  This 
is  a  radical  approach,  and  several  efforts 
are  under  way.  With  frequent  changes  in 
models  and  architectures,  it  is  veiy'  difficult 
to  convince  users  to  ntake  substantial  in¬ 
vestments  in  iioftware  as  both  langtiages 
and  architectures  are  evolving.  Serious 
users  will  not  take  this  route  unless  a  new 
language  is  accepted  as  a  standard. 

"Dusty  Deck"  Approach 

This  method  focused  on  nested  DO  loops 
interpretation  and  interprocedutal  analysis  (the 
tracing  of  data  across  procedure  calls).  Most  of 


the  research  in  this  area  has  been  on  shared-memo- 
ly  models,  but  progress  on  distributed  and  hybrid 
memory  systems  is  imminent.  Although  most  of 
significant  work  on  this  topic  has  been  done  in  the 
U.S.  (Rice  University,  IBM,  University  of  Illinois, 
CalTech,  and  Cray  Research),  a  number  of  signifi¬ 
cant  projects  in  Europe  demonstrate  genuine  prog¬ 
ress  in  semiautomatic  code  parallelization. 

•  Suprenum  Parallelizer  Bonn  (SUPERB), 
developed  by  Zima  et  al.'^  is  a  semiauto¬ 
matic,  interactive  SIMD/MIMD  code 
parallelizer  of  Fortran  77.  SUPERB  takes 
Fortran  77  code  as  an  input  and  produces 
an  equivalent  Suprenum  Fortran  program 
at  the  output.  This  is  one  of  the  most 
advanced  Fortran  parallelizing  computers; 
it  also  handles  array  manipulations  as  in 
Fortran  90.  It  has  been  implemented  on 
the  Suprenum  parallel  computer. 

•  Velour  .s  a  fully  automatic  vectoriser, 
designed  and  developed  by  Dehbonei  and 
Memmi  at  the  Bull  Research  Center  in 
France.^®  It  is  currently  being  extended  to 
produce  an  automatic  parallelizer  for  the 
ESPRIT  Euro  Work  Station  (EWS).  EWS 
is  a  high-performance  workstation  with  a 
parallel  system  accelerator. 

•  Supemode  2  Parallelizing  Compiler  is 
another  ESPRIT  project  whose  goal  is  to 
design  and  implement  a  parallelising  For¬ 
tran  90  compiler  for  transputer-based 
parallel  computers.  It  is  marketed  by 
Parsys  (U.K.)  and  Telmat  (France). 

Extension-based  Approach 

This  approach  has  evolved  in  two  directions: 

•  array  and  vector  processing  (SIMD-like 
processing);  and 

•  multiprocessor/distributed  computing 
(MIMD-like  processing) 

This  class  of  languages  provides  explicit  extensionr 
for  supporting  parallelism  and  allows  programmers 
to  express  parallel  data  and  manipulate  insttuctions. 
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Let  u$  concentrate  first  on  array  and  vector 
processing  languages  where  the  goal  is  to  broadcast 
a  single  instruction  and  apply  it  simultaneously  to 
an  array  or  a  vector  of  data. 

This  data  parallelism  has  been  successfully 
applied  in  a  number  of  array  processors.  Exam¬ 
ples  include  Illiac  IV,*‘  Massively  Parallel  Proces¬ 
sor,  Connection  Machine, and  (in  Europe)  the 
Distributed  Array  Processor.^^  Most  of  the  lan¬ 
guages  for  these  machines  were  extensions  to 
Fortran  with  some  exceptions  such  as  Actus^^  and 
Booster.^® 

The  main  European  extension-based  languages 
for  SIMD  processing  are  briefly  described  here. 

•  Fortran  Plus  is  the  main  programming 
language  for  the  Distributed  Array  Proces¬ 
sor.  An  easy-to-learn  extension  to  Fortran, 
it  features  automatic  folding  (i.e.,  a  user 
can  declare  an  arbitrary  size,  vector,  or 
matrix  that  will  be  automatically  mapped 
into  a  processor  array)  and  a  flexible  meth¬ 
od  of  manipulating  arbitrary  array  elements 
or  subarrays. 

•  Actus  concentrates  on  the  syntactic  and 
semantic  rules  that  would  support  parallel¬ 
ism.  Actus  provides  a  user  with  the  ability 
to  explicitly  specify  the  degree  or  extent  of 
parallelism  in  both  the  data  and  the  instruc¬ 
tions.  It  supports  user-constructed  data 
types  and  program  structures,  on-line 
control  of  parallelism,  and  independent 
indexing. 

•  Booster  is  a  high-level  language  for  ex¬ 
pressing  array  and  vector  parallelism  at  the 
algorithm  level.  Programs  written  in 
Booster  can  be  embedded  in  a  traditional 
programming  environment,  and  code  can 
be  generated  in  a  number  of  classical  im¬ 
perative  languages.  Two  fundamental 
constructs  of  the  language  are  shapes  and 
views.  Shape  is  the  only  available  array¬ 
like  data  structure  with  more  diversified 
indexing  than  an  ordinary  array,  and  the 
view  is  a  reference  to  shape  elements. 

This  approach  frees  a  programmer  from 
defining  loops  and  index  variables. 


•  Fortran  90  is  a  successor  to  Fortran  77  and 
will  very  likely  become  one  of  the  most 
important  languages  for  array/vector  pro¬ 
cessing  in  the  1990s.  It  has  been  accepted 
as  a  standard  after  more  than  10  years  of 
deliberations  and  is  being  adopted  by  the 
ever-increasing  number  of  parallel  systems 
developers.  It  draws  on  past  experience 
with  Actus,  Booster,  and  Fortran  Plus,  and 
seems  to  be  gaining  acceptance  by  the 
scientific  computing  community. 

•  Eva  is  an  explicit  vector  language  devel¬ 
oped  at  the  University  of  Lille,  France.^® 
Eva  allows  explicit  vector  handling  and 
sharing  of  Fortran  data  and  C  programs. 

•  Hellena  is  a  vector  processing  language 
conceived  at  the  University  of  Rennes.^^ 


NEW  PARALLEL  LANGUAGES 
Imperative  Languages 

Explicit  Process  Control  Approach 

The  arena  of  multiprocessor/distribuied  sys¬ 
tems  has  been  most  active  since  the  early  1960s. 
During  this  time,  programmers  realized  that  they 
could  handle  and  control  processes  [process  (task)] 
to  maximize  resource  utilization.  This  concept  is 
especially  attractive  in  parallel  and  distributed 
computing  environments. 

Concepts  such  as  conditional  synchronization, 
mutual  exclusion,  and  monitor  followed.  These 
resulted  in  a  series  of  techniques  that  produced 
solutions  to  several  problems  originating  from 
these  concepts. 

A  monitor^®  defines  a  shared  data  structure  and 
the  operations  (procedures  or  functions)  that  cati  be 
performed  on  it;  mutual  exclusion  and  conditional 
synchronization  concepts  have  been  incorporated. 
The  monitor  approach  is  used  in,  for  example. 
Concurrent,  Pascal,^’  Modula,®®  and  Pascal  Plus.®* 
It  is  especially  well-suited  for  shared  memory 
architectures. 

Another  approach  for  coping  with  the  mutual 
exclusion  and  conditional  synchronization  is  based 
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on  the  message-passing  concept  This  is  a  depar¬ 
ture  from  the  traditional  hierarchical  relationship 
among  programs  or  routines  and  provides  for 
symmetric  and  asymmetric  relations  at  the  same 
level.  A  concept  of  guarded  command^^  was  later 
used  by  Hoare^^  for  communicating  sequential  pro¬ 
cesses  (CSP)  where  symmetrical  relationships  exist 
between  communicating  processes.  The  asymmet¬ 
ric  relationship  between  communicating  processes 
has  been  introduced  in  distributed  processes  (DP)^^ 
whereby  the  called  process  does  not  need  to  know 
the  name  of  the  process  that  called  it. 

Many  of  these  concepts  formed  the  basis  for 
languages  such  as  Ada^^  and  Occam. Ada  was 
developed  in  France  for  the  U.S.  Department  of 
Defense;  Occam  was  developed  in  the  U.K.  and 
was  adopted  by  Inmos  as  a  design  language  and  a 
program  development  language  for  parallel  com¬ 
puters. 

Major  language  development  projects  in  Eu¬ 
rope  include: 

•  Full  OCCAM  is  a  successor  to  original 
OCCAM.  It  was  developed  as  part  of 
ESPRIT’S  PUMA  (Parallel  Universal 
Machine  Architecture)  prcject  and  is 
geared  toward  the  T9000  Inmos  transputer. 

•  Ada  9X  is  a  current  effort  in  updating  the 
Ada  standard.  Although  led  by  U.S.  scien¬ 
tists,  it  is  heavily  influenced  by  European 
contributions. 

•  SCRIPTS  is  a  language  that  exploits  other 
model  of  concurrent  and  distributed  com¬ 
puting.  It  is  being  developed  at  CNRS, 
Besancon,  France,  as  part  of  the  ESPRIT 
high-performance  computing  initiative. 

•  Modula-3  (University  of  Karlsruhe,  FRG) 
is  an  extension  to  Moduia-2  and  has  syn¬ 
chronous  and  asynchronous  features  plus 
declarations  for  mapping  array  data  onto 
processors.  It  is  suitable  for  both  SIMD 
and  MIMD  architectures. 

•  Pascal  Plus  (The  (^een’s  University  at 
Belfast)  is  a  modular  multiprocessing  lan¬ 
guage.  It  supports  data  abstraction  as 
defined  by  a  user.  It  also  incorporates 


general  processes  for  parallelism  identifica¬ 
tion  and  monitors  for  synchronization  and 
mutual  exclusion  identification. 

•  Linda  (University  of  Aalborg,  Denmark)  is 
a  follow-up  on  the  U.S.  effort^*  in  parallel 
programming  for  distributed  computing. 
The  objective  is  to  maximize  utilization  of 
processors  in  parallel/distributed  computing 
systems  that  have  message-passing  archi¬ 
tectures. 

•  3L  (Lattice  Logic  Limited,  U.K.)  extends 
Fortran  to  support  task  definition  and 
interprocess  communication. 

Object-oriented  Languages 

Object-oriented  programming  is  considered  to 
be  the  most  promising  imperative  language  ap¬ 
proach  for  the  future.  An  object  is  used  to  repre¬ 
sent  both  data  and  the  means  of  manipulating  that 
data.  Objects  are  expected  to  simplify  program¬ 
ming,  increase  reliability,  improve  security,  and 
ease  structuring  and  integration  of  programs. 

Simula,  which  was  developed  in  Europe,  can 
be  viewed  as  a  predecessor  in  terms  of  structure  of 
the  object-oriented  approach.  Smalltalk,  Objective 

and  C-l-  are  considered  to  be  typical 
representatives  of  this  family  of  languages.  They 
have  been  designed  for  strictly  sequential, 
uniprocessor  systems. 

Extensions  to  parallel  programming; 

•  require  that  more  than  one  object  be  active 
(it  is  considered  to  be  a  process); 

•  allow  simultaneous  sending  and  receiving 
of  messages  on  different  objects; 

•  permit  messages  to  be  broadcast  to  several 
objects;  and 

•  permit  an  object  to  be  active  without  being 
received  as  a  message. 

Major  European  efforts  in  this  areas  include; 

•  POOL-2  (Parallel  Obj«;t-oriented  Lan¬ 
guage)^*  was  develop^  at  niilips  (The 
Netherlands)  for  prograinming  symbolic 
{q>plications.  The  applications  can  be 
processed  on  DOOM  (Decentralized 
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Object-Oriented  Machine),  a 
100-processor,  multistage  network-based 
parallel  computer POOL-2  allows 
strong  typing  and  synchronous  and 
asynchronous  communication  based  on 
message  passing. 

•  PO  (Parallel  Objects)  is  an  object-oriented 
language  developed  at  the  University  of 
Bologna  (Italy)  for  Meiko  transputer-based 
parallel  platforms.  It  offers  flexible  com¬ 
munication  modes  and  dynamic  creation  of 
objects. 

•  OBLOG  (Object-Oriented  Logic)  is  an 
object-oriented  and  concurrent  approach  to 
systems  specification  that  uses  textual  and 
graphical  constructs.  It  was  developed  at 
the  University  of  Aveiro  (Portugal). 

DECLARATIVE  LANGUAGES 

Declarative  languages  use  a  notation  to  repre¬ 
sent  what  is  to  be  calculated  rather  than  how  it  is 
supposed  to  be  calculated.  There  are  two  main 
models: 

•  the  logic  model,  which  is  based  on  the 
relation  between  entries,  and 

•  the  functional  model,  which  is  based  on  the 
use  of  functions. 

A  key  advantage  of  declarative  languages  is  that 
they  have  a  sound  mathematical  basis;  therefore 
diey  have  the  ability  to  prove  a  program  correct¬ 
ors.  This  property  is  vital  in  the  world  of  ever- 
increasing  computer  use  and  reliance  on  them. 

Logic  Languages 

Logic  languages  are  based  on  formalized  meth¬ 
ods  of  reasoning  that  use  inferences  and  deduc¬ 
tions.  This  approach  concentrates  on  proving 
entities  radier  than  on  construring  entities,  as  in 
imperative  languages. 

Europeans  pioneered  effort  in  this  area  by 
develq>ing  a  language  called  Prolog  (PROgram- 
ming  L(X}ic);  it  was  devel(:^>ed  by  Colmerauer  and 
his  colleagues  at  the  University  of  Marseilles  in 
early  1970s.  Prolog  is  widely  used,  mainly  in 


Europe  and  Japan,  mostly  in  the  area  of  artificial 
intelligence.  Extension  to  parallelism  include 
AND-parallelism,  OR-parallelism,  and  parallel 
pattern  matching. 

In  Europe  substantial  efforts  are  being  made  to 
create  parallel  logic  languages.  They  include: 

•  Parlog  (Parallel  Logic)  is  a  concurrent 
programming  language  based  on  Prolog 
and  AND-parallelism.  It  was  developed  at 
Imperial  College,  London.'*^^ 

•  Strand  is  a  parallel  Prolog-like  language 
whose  commercial  version  is  marketed  by 
Strand  Software  Technologies,  U.K.*” 

•  PEPMA  is  parallel,  Prolog-like  language 
ESPRIT  project  for  shared  memory  ma¬ 
chines. 

•  PALAVDA  is  an  ESPRIT  project  that 
focuses  on  Parallel  Architectures  and  Lan¬ 
guages  for  Advanced  Information  Process¬ 
ing.  PALAVDA  aims  at  analyzing  differ¬ 
ent  architectures  to  develop  parallel  archi¬ 
tectures  for  efficient  execution  of  logic, 
functional,  and  object-oriented  languages. 

•  PADMAVATI  (Parallel  Associative  Devel¬ 
opment  Machine  As  a  Vehicle  for  Artificial 
Intelligence)  has  among  other  goals 
parallelization  of  Prolog,  procedures  devel¬ 
opment  for  handling  ports  and  message 
passing,  and  the  use  of  AND-parallelism. 

•  COMPULOG  (Computational  Logic)  is  an 
ESPRIT  project  that  concentrates  on  exten¬ 
sions  to  logic  programming  by  incorporat¬ 
ing  mathematical  logic,  computer  algebra, 
deductive  databases,  and  artificial  intelli¬ 
gence. 

•  OPERA  (Laboratoire  de  Genie 
Informatique  de  Grenoble  and  CAP-Gemi- 
ni-Innovation,  France)  is  a  project  aiming 
at  efficient  inqilementation  of  a  parallel 
Prolog-like  language  that  would  exploit 
OR-parallelism.  It  is  designed  for  transpu¬ 
ter-based  systems  such  as  Supernode  plat¬ 
forms. 
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•  PEPSys  is  a  project  focused  on  practical 
solutions  to  parallel  logic  progranuning/^ 
It  combines  AND-  and  OR-oarallelism  and 
aims  at  solving  such  a  challenging  prob¬ 
lems  as  scalability  and  portability. 


Funcdonal  Languages 

Functional  programming  boosts  ease  of  pro¬ 
gramming  but  lacks  efficiency.  It  is  hoped  that 
efficiency  can  be  improved  with  the  advent  of 
parallel  architectures. 

Europeans  have  strong  traditions  and  the  lead 
in  several  areas  of  functional  languages  develop¬ 
ment.  The  early  work  dates  back  to  1970s  and 
includes  SASL  (St.  Andrews  Static  Language);^ 
ML  (Meta  Language)  KRC  (Kent  Recursive 
Calculator);^  SML  (Standard  ML);^®*“  and 
Miranda.^*  A  recent  standardization  effort  under¬ 
taken  by  researchers  in  Europe  and  America  pro¬ 
duced  a  new  functiomd  language  called  Haskell. 
Despite  an  impressive  number  of  features  and 
tremendous  expressive  power,  Haskell  suffers  from 
classical  inefficiencies  caused  by  memory  manage¬ 
ment  and  high  bandwidth  requirements.  It  is 
hoped  that  a  move  toward  parallel  processing  and 
vertical  migration  will  alleviate  at  least  some  of  the 
problems. 

It  should  be  noted  that  dataflow  languages  such 
as  MIT’s  ID,”  VAL,”  and  SISAL”  display  func¬ 
tional  language  characteristics.  Functional  lan¬ 
guages  for  parallelism  include  ParAlf,”  Caliban,” 
MultiLisp,”  and  Concurrent  Lisp.” 

Key  European  projects  include; 

1.  Parallel  Conqmter  Systems  for  Integrated 
Numeric  and  Symbolic  Processing.  The 
objective  of  this  ESPRIT  project  is  to 
develop  three  types  of  languages  and  exam¬ 
ine  their  ai^li^iiiiy  to  symbolic  and 
numeric  processing.  These  languages  are; 

•  D-Lisp  -  an  extended  ftinctional  language 
based  on  LeLisp, 

•  SOLVE  -  an  (Ajea-oriented  program¬ 
ming  language,  and 


•  SPINLOG  -  an  extended  logic  language 
based  on  C-Prolog. 

2.  PALAVDA  (Parallel  Architectures  and 
Languages  for  Advanced  Information  Pro¬ 
cessing)  is  also  an  ESPRIT  project.  It  has 
a  similar  goal  of  comparing  logic  and 
functional  languages.  They  are; 

•  K-LEAF  -  Logic  and  functional  language 
based  on  Horn  Clause  Logic, 

•  IDEAL  -  (an  Ideal  Deductive  and  Appli¬ 
cative  Language)  is  also  a  language  that 
includes  logic  and  functional  elements. 

3.  GRIP  is  a  shared-memory,  80-processor, 
parallel  graph  reduction  machine  for  effi¬ 
cient  execution  of  functional  languages.  It 
was  developed  at  the  University  College, 
London. 

4.  Caliban  is  a  language  that  facilitates  paral¬ 
lel  execution  on  specific  adjacent  proces¬ 
sors  and  can  be  used  in  both  parallel  and 
distributed  computing  environments. 

5.  FLAGSHIPS®®  aims  at  developing  a  pro¬ 
gramming  environment  for  distribute 
computations  that  facilitates  executi  n  of 
declarative  languajes. 

6.  PADMAVATI  (Parallel  Associative  Devel¬ 
opment  Machine  as  a  Vehicle  for  Artificial 
Intelligence)  is  an  ESPRIT  project  for 
developing  languages  for  a  parallel, 
transputer-based  computer. 

7.  Concurrem  CLEAN  is  a  functional  lan¬ 
guage  based  on  parallel  fimctional  graph 
rewriting.  It  is  being  developed  at  the 
University  of  Nymegen,  The  Netherlands, 
as  part  of  ESPRlT’s  Parallel  Compu  'ng 
Action. 

8.  Haskell  is  the  first  functional  language 
acc^ted  as  an  intemationai  standard.  It  is 
being  inqilemented  on  trans|»iter-based 
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parallel  coiiq>uters  at  the  Universities  of  Glas¬ 
gow  and  Southanq)ton. 

Conclusions 

Europe’s  outstanding  contributions  to  the  de¬ 
velopment  of  programming  languages  for  sequen¬ 
tial  and  parallel  architectures  are  unquestionable. 
The  European  Community  (EC)  supports  a  number 
of  comparative  analysis  projects  that  will  shed 
more  light  on  which  languages  will  be  most  suit¬ 
able  for  future  computer  generations.  Other  EC 
projects  may  result  in  the  development  of  a  new 
bre^  of  languages.  I  expect  that  new  languages 
will  have  a  balanced  mix  of  imperative  and  declar¬ 
ative  programming  methods  to  maximize  execution 
efficiency,  ease  of  programming,  and  reliability. 

In  fact,  researchers  are  beginning  to  realize  that 
more  similarities  may  exist  between  various  lan¬ 
guage  types  than  was  originally  thought. 

Despite  the  indisputable  lead  in  some  areas  of 
parallel  programming,  Europe  has  not  been  able  to 
capitalize  on  its  contributions.  There  are  multiple 
reasons  for  this.  Some  of  them  include; 

•  Cautious  approach  by  scientific  code  devel¬ 
opers  who  are  not  willing  to  make  consid¬ 
erable  investments  in  software  unless  a 
given  language  becomes  a  standard  and  is 
architecture-independent.  This  is  the  rea¬ 
son  that  Fortran  extensions  are  commer¬ 
cially  most  successful. 

•  Architecture-dependent  developments.  It 
seems  that  quite  a  few  languages  support 
transputer-based  architectures.  While 
concepmally  these  approaches  are  pure  and 
elegant,  they  fail  to  win  widespread  accep¬ 
tance  simply  because  of  lack  of  tools, 
optimizing  compilers,  and  wide  industrial 
support. 

Intemati  c'ul  standardization  efforts  are  badly 
needed.  The  O  en  Software  Foundation  and  the 
Parallel  Computing  Forum  are  good  catalysts  and 
vehicles  for  technology  transfer  and  stronger  inter¬ 
actions  between  American  and  European  research¬ 
ers.  Hicse  organizations  should  focus  on  develop¬ 
ment  of  standards.  The  experiences  of  Japanese 


researchers,  especially  in  logic  programming, 
should  also  be  shared. 

In  the  near  term,  accelerated  efforts  in  devel¬ 
oping  parallel  C  is  badly  needed.  In  the  long  term, 
object-oriented  languages  with  strong  logic  and 
functional  component  will  very  likely  dominate. 
The  IBM/ Apple  alliance  may  result  in  providing  a 
programming  language  for  all  of  us,  but  the  Tower 
of  Babel  will  not  cease  to  exist. 

It  seems  that  the  new  breed  of  languages,  i.e., 
application-specific  languages,  will  fiuorish— 
provided  application  experts  such  as  physicists, 
chemists,  biologists,  and  others  are  ready  and 
willing  to  explore  grand  challenges.  Language 
developers  should  work  closer  with  the  applications 
community  to  meet  these  challenges  for  the  1990s 
and  beyond. 

OPERATING  SYSTEMS 

Several  operating  systems  for  parallel  and 
distributed  computing  systems  have  been  or  are 
being  developed.  It  is  rather  mundane  to  describe 
all  of  them  in  detail;  most  of  them  have  very  simi¬ 
lar  features,  with  one  or  two  special  capabilities 
standing  out.  To  give  a  fair  representation,  two 
operating  systems,  PEACE  and  Chorus,  are  de¬ 
scribed.  PEACE  was  designed  for  MIMD  mes¬ 
sage-passing  architectures  and  is  an  experimental 
system  for  parallel  computers;  Chorus  is  a  highly 
advanced  commercial  system  for  distributed  com¬ 
puting  that  competes  on  par  with  a  variety  of 
UNIX-type  systems  including  Mach.  (Other  sys¬ 
tems  such  as  Amoeba  and  DELTA-4  have  been 
described  previously.) 

PEACE 

Synopsis 

PEACE  (Process  Executions  And  Communica¬ 
tion  Environment)  is  a  decentralized/distributed 
operating  system  that  provides  a  runtime  environ¬ 
ment  for  MIMD  message-passing  parallel  comput¬ 
ers  such  as  SUPERENUM  or  GENESIS.  It  was 
developed  by  GMD-FIRST  in  Berlin  and  sponsored 
by  the  German  Ministrj'  of  Research  and  Technolo¬ 
gy  (BMFT). 
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Software  Structure 

The  PEACE  operating  system  was  designed  in 
a  spirit  of  tailoring  the  requirements  to  super¬ 
computer  architectures  (such  as  SUPRENUM)  and 
carefully  examining  the  extensive  past  experience 
in  operating  systems  design. 

The  family-of-operating-systems  concept  im¬ 
plies  an  object-oriented  design  where  uniquely 
addressable  system  components  such  as  processes 
are  encapsulated.  Lightweight  processes  are 
grouped  into  teams  and  are  given  a  common  execu¬ 
tion  domain.  A  team  process  is  similar  to  a  tradi¬ 
tional  process.  The  difference  is  that  a  concurrent 
execution  of  services  within  a  team  is  feasible. 
Switching  between  processes  within  a  single  team 
takes  about  SO  microseconds  on  a  SUPRENUM 
system. 

Interprocess  communication  is  based  on  the 
remote  invocation  send  model  that  uses  lightweight 
processes  and  movefrom/moveto  primitives  to 
permit  files  of  arbitrary  size  to  be  transferred 
between  client  and  server  teams. 

The  message-passing  kernel  is  the  core  of  the 
PEACE  operating  system.  It  supports; 

•  team  scheduling, 

•  dispatching  of  lightweight  processes, 

•  interprocess  communication, 

•  low-level  device  management  (device  driv¬ 
er  via  FPC  and  trap/interrupt),  and 

•  RPC  interface  (present  on  each  host  in  a 
form  of  the  kernel  team). 

The  hierarchical  structure  of  PEACE  includes 
several  system  processes  that  provide  services  to 
higher  level  application  processes.  The  systems, 
which  are  quite  standard,  comprise;  name  server; 
name  replugger  (tor  matching  service  names  with 
servers);  process  server  and  address  space  server 
(for  process  and  address  nuuiagement);  MMU 
server  (for  handling  memory  management  units); 
team  server  (to  facilitate  interface  between  the 
processes  and  address  space  management  and  the 
application),  loader;  panic  server  (for  handling 
exceptional  events);  clock  server;  printer  server; 
net  server;  disk  server;  file  server;  and  signal 
server  (enables  propagation  of  specific  exceptions). 

PEACE'S  performance  is  impressive;  it  com¬ 
pares  well  with  some  of  the  fastest  message-passing 
systems  [such  as  Amoeba  and  V-system.  An  RPC 


(send-receive-reply  takes  about  1.2  ms  and  band¬ 
width  ranges  from  2.7  to  4.9  Mbytes/s  for 
movefrom/moveto  transfer  protocols. 

Conclusions 

PEACE  is  a  lightweight,  processes-based, 
synchronous  message-passing  operating  system, 
comparable  with  Amoeba  and  V  in  performance. 
Although  its  applicability  to  parallel  processing, 
especially  with  respect  to  portability  and  scalability 
ne^s  to  be  further  tested,  it  presents  an  interesting 
entry  to  the  family  of  operating  systems. 

For  further  information  contact; 

Wolfgang  Schroeder 
German  Institute  for  Mathematics 
and  Data  Processing 
GMD  FIRST 
Hardenbergplatz  2 
D-1000  Berlin  12 
Federal  Republic  of  Germany 

CHORUS 

Synopsis 

The  Chorus  family  of  distributed  operating 
system  products  integrates  open  systems  with  user- 
transparent  distributed  processing  and  some 
real-time  attributes. 

Chorus  Systems,  a  French  company  established 
in  1986  with  $8(X)K  in  start-up  capital,  is  emerging 
as  Europe’s  main  developer  of  distributed  operat¬ 
ing  systems.  Through  strategic  alliances  with  some 
U.S.  companies  such  as  UNISYS,  it  is  trying  to 
compete  with  UNIX  and  Mach.  Chorus  Systems  is 
aim^  at  UNIX-type  applications  in  business  and 
finance  as  well  as  at  real-time  systems  users  in 
aerospace,  telecommunications,  and  other  indus¬ 
tries.  Chorus  runs  on  a  variety  of  hardware  plat¬ 
forms,  ranging  from  personal  computers  to 
workstations. 

Key  characteristics  include  high  efficiency, 
high  performance,  modularity,  sin^licity,  ease  of 
distributed  programming,  openness,  high  availabili¬ 
ty,  enhanced  security,  support  for  standard  commu¬ 
nication  protocols,  and  extended  development  en¬ 
vironment  (communication  debugger  and  simulator 
for  prototyping). 
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Software  Architecture 

The  basic  philosophy  behind  the  Chorus  oper¬ 
ating  system  is  modularity  (which  facilitates  dy¬ 
namic  system  reconfiguration  for  use  with  a  wide 
range  of  applications)  and  development  of  a  small 
kernel.  This  kernel  supports  only  very  generic 
functions  (such  as  file  access  methods  and  system 
administration)  while  moving  all  other  services 
(which  can  be  more  efficiently  performed  outside) 
to  specific  modules. 

The  Chorus  architecture  is  composed  of  a 
small-sized  nucleus  and  number  of  system  servers. 
The  nucleus  integrates  computing  ai:d  communica¬ 
tion  at  the  lowest  level.  Chorus  operating  systems 
are  configured  as  sets  of  independent  system  serv¬ 
ers  that  rely  on  basic  services  provided  by  the 
nucleus.  These  services  include  thread  scheduling, 
interprocess  communication,  virtual  memory  man¬ 
agement,  and  event  timing. 

A  set  of  system  servers  forms  a  subsystem 
(e.g.,  UNIX)  that  provides  services  to  application 
programs  (users).  The  nucleus  uses  a  series  of 
abstractions  such  as  actors  (units  of  resource  distr 
butions/collection,  and  memor}'  address  space); 
threads  (units  of  sequential  execution);  messages; 
ports;  port  groups;  ’inique  identifiers;  and  regi/'ns 
(units  of  address  space  that  structure  an  actor). 
These  abstractions  represent  object  classes  that  are 
private  to  the  nucleus. 

Other  abstractions  managed  by  both  the  nucle¬ 
us  and  the  subsystem  actors  include  segments  (for 
data  encapsulation);  capabilities  (for  data  access 
control);  and  protection  identifier  (for  authentica¬ 
tion).  Actors  can  be  viewed  as  virtual  machines, 
while  threads  are  processes. 

The  Chorus  nucleus  can  be  installed  on  a 
variety  of  platforms,  ranging  from  a  parallel  com¬ 
puters  to  a  widely  distributed  network  of 
workstations.  UNIX  SYSTEM  V  and  other  operat¬ 
ing  systems  can  execute  on  top  of  the  Chorus 
nucleus. 

Chorus  is  written  in  C  +  +  (and  C)  and  sup¬ 
ports  Motorola  680X0  and  Intel  80386  based  ma¬ 
chines.  The  typical  Chorus  subsystem  has  standard 
UNIX  servers  (such  as  network  manager,  process 
manager,  file  manager,  device  manager,  and  select 
manager)  implemented  on  several  hardware  plat¬ 
forms. 


Conclusions 

Chorus  is  a  modem,  commercial-quality, 
distributed  operating  system  that  provides  modular¬ 
ity,  scalability,  portability,  and  most  importantly, 
simplicity.  It  is  a  good  candidate  to  carry  distrib¬ 
uted  computing  into  the  next  decade  where  integra¬ 
tion  of  computations  and  communications  (includ¬ 
ing  "computing  while  communicating")  will  be 
basic. 

For  further  information  contact: 

Hubert  Zimmermann 
Chorus  Systems 
6,  Avenue  Gustave  Eiffel 
F-78182,  Saint-Quentin-En-Yvelines 
France 

EUROPEAN  SOFTWARE  AND  SYSTEMS 
INITIATIVE  (ESSI) 

The  European  Software  and  Systems  Initiative 
(ESSI)  is  a  lOSM  ECU  ($140  million)  research  and 
development  program.  ESSI’s  main  objective  is 
improving  the  productivity  and  quality  of  software 
systems  program.  The  program  is  focused  on 
three  major  themes; 

1 .  Object-Oriented  Programming 

2.  Reuse 

3.  Training. 

Europeans  strongly  believe  that  the  next  gener¬ 
ation  of  programming  will  be  object-oriented.  This 
philosophy  dominates  in  industrial  circles  as  well 
as  in  parts  of  the  research  community.  In  the  U.S. 
also,  industry  is  moving  in  this  direction;  hence  the 
merger  of  Apple  and  IBM.  They  forged  an  alli¬ 
ance  with  the  goal  of  delivering  the  next  generation 
of  operating  systems  to  support  object-oriented 
programming.  ESSI  wants  to  become  a  vehicle  for 
aggressive  software  information  exchange;  it  wants 
to  establish  software  clearing  houses  for  tools  and 
software  modules,  especially  objects.  The  ultimate 
goal  is  to  create  a  European  library  of  software 
modules. 

The  other  two  themes,  reuse  and  training,  are 
more  conservative  and  will  focus  on  methods  of 
software  reuse,  standardization,  and  education.  A 
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research  component  will  include  tools  develop¬ 
ment,  application  experiments,  testing,  and  parallel 
programming  methods.  One  of  the  most  fascinat¬ 
ing  goals  is  the  idea  of  accelerated  quality  "feed¬ 
back.  "  The  interesting  idea,  which  at  this  time  is 
at  the  wishful  thinking  stage,  is  to  provide  an 
"instant"  quality  assessment  during  the  process  of 
software  development. 

The  reuse  activities  for  conventional  software, 
as  for  object-oriented  software,  will  concentrate  on 
software  information  exchange,  clearing  activities, 
and  the  establishment  of  software  module  libraries. 
Training  will  focus  on  tutorials,  workshops,  cours¬ 
es,  and  on  reaching  and  linking  user,*;,  vendors, 
and  developers. 

ESSI  is  rather  a  conservative  program,  with 
plans  to  spend  most  of  its  funding  on  evolution  (90 
percent  of  budget)  rather  than  revolution  (10  per¬ 
cent).  The  focus  is  on  topics  that  have  proven  to 
be  effective  in  commercial  practice.  ESSI  aims  at 
establishing  quality  standards  and  software  courses 
syllabuses  and  disseminating  modem  methods  and 
software  practices.  With  a  shortfall  of  about 
150,00  software  engineers  in  Europe,  training  is 
viewed  as  one  of  the  most  important  aspects  of  the 
program. 

Conclusions 

ESSI  is  a  low-risk  research  and  development 
program  with  significant  potential  for  being  suc¬ 
cessful.  There  is  a  growing  consensus  in  Europe 
that  its  potential  is  in  software  rather  than  in  hard¬ 
ware  where  large  capital  investments  would  be 
needed  to  match  the  U.S.  and  Japan.  Software 
requires  relatively  low  capital  and  the  payoff  can 
be  very  be  high.  ESSI  will  very  likely  provide  a 
valid  infrastmcture  for  quality  software,  good 
training  bases,  standards,  and  reuse  libraries.  The 
research  efforts  need  to  be  concentrated  and  it 
seems  that  a  creation  of  software  engineering  insti¬ 
tute  similar  to  the  Software  Engineering  Institute  in 
Pittsburgh  is  imminent. 

5.  CONCLUSIONS,  CHALLENGES, 

AND  DIRECTIONS 

Europe  is  a  major  force  in  computer  research; 
it  has  superb  scientists,  good  facilities,  and  acceler¬ 
ated  research  funding.  European  programs  such  as 


ESPRIT,  EUREKA,  and  RACE  will  help  to  put 
Europe  on  par  with  the  U.S.  and  Japan  in  comput¬ 
er  support  infrastructure  and  in  the  lead  in  certain 
areas. 

ESPRIT,  EUREKA,  and  RACE,  with  total 
funding  of  more  than  $20  billion  for  1985-1995, 
outline  a  coordinated  research  policy  in  computing 
and  communication.  On  one  hand,  such  a  policy 
for  diverse  Europe  is  badly  needed;  it  will  enforce 
unification  and  standardization.  On  the  other  hand, 
it  may  stifle  research  progress.  ESPRIT  ahd  other 
programs  seem  to  be  evolutionary  rather  than 
revolutionary.  At  this  time,  except  for  a  few  pro¬ 
jects,  most  research  goals  are  similar  to  those  ir 
the  U.S.  and  Japan. 

ESPRIT’S  strength  is  in  its  ability  to  facilitate 
cooperation  between  academia  and  industry;  its 
weakness  is  in  its  being  too  structured,  too  politi¬ 
cal,  and  too  even-handed.  Especially  on  big  pro¬ 
ject?  that  involve  20  or  so  participants,  a  lot  of 
energy  appiears  to  be  spent  on  coordination  rather 
than  cn  research.  Critics  say  ESPRIT  is  a  "social" 
program;  cynics  say  that  it  has  mainly  increased 
the  revenues  of  airline  companies.  However,  no 
one  can  deny  that  results  of  truly  international 
cooperation  are  evident,  and  European  researchers 
are  now  aware  of  each  others’  accomplishments  (in 
the  past  their  only  reference  point  was  research  in 
the  U.S.). 

There  are  a  number  of  low-profile  success 
stories— Europe’s  computer  research  is  not  about 
"chasing  GFL.OPS,"  it  is  mainly  based  on  long¬ 
term  commitment  to  principles.  Progress  and 
leadership  in  formal  methods,  parallel  program¬ 
ming,  safety,  and  some  parallel  applications  are 
well  known.  In  my  opinion,  Europe  should  focus 
on  software  technology  and  networking  and  have 
ASIC  developments  in  semiconductor  teclmology. 
Competing  in  the  microprocessor  and  memory 
markets  is  expensive,  and  challenges  are  mainly  of 
a  technological  nature. 

The  "global  village"  is  becoming  a  reality  in 
computer  research.  Computer  companies  are 
multinational  (the  case  of  a  chip  ordered  by  Philips 
(The  Netherlands),  designed  in  the  U.K.,  manufac¬ 
tured  in  Japan,  tested  in  Taiwan,  and  distributed 
worldwide  is  not  uncommon). 

The  globalization  of  computer  research  will 
give  advantages  to  countries  that  are  capable  of  fast 
technology  transfer  (from  this  perspective,  the 
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economic  miracle  in  Ja^ian  is  no  surprise;  technolo¬ 
gy  transfer  there  takes  two  years  less  thari  it  does 
in  the  U.S.}- 

Many  high-performance  computing  efforts  in 
Europe  measure  up  to  or  surpass  equivalent  efforts 
elsewhere.  In  terms  of  what  is  known  as  general- 
purpose  parallel  computing,  GP-MIMD,  GENE¬ 
SIS,  and  Supemode  machines  seem  to  be  promis¬ 
ing  and  interesting.  Close  cooperation  between 
academic,  microprocessor  manufacturer  (Irunos) 
and  system  houses  (Meiko,  Parsys,  and  Telmat) 
may  result  in  significant  progress  toward  standard¬ 
ization,  portability,  and  scalabiiity  in  parallel  com¬ 
puting.  Special-purpose  machines  such  as  the 
APEIOO  may  become  the  world’s  fastest  QCD 
computer  ,  while  vector  machines  such  as  AMT  and 
ASP  will  deliver  extraordinary  performance  for 
selected  applications,  especially  in  signal  process¬ 
ing.  EDS  may  become  one  of  the  most  competi¬ 
tive  database  machiiies,  and  projects  such  as 
Amoeba,  Delta-4,  MARS,  and  REX  contribute  to 
progress  in  efficient  and  reliable  distributed  com¬ 
puting.  In  addition,  the  REX  project  focuses  on 
management  policy,  which  is  becoming  increasing¬ 
ly  important  in  configurable  distributed  systems. 

These  are  some  selected  highlights  in  European 
systems  work,  but  Europe  also  has  a  lot  to  offer  in 
formal  methods,  modeling,  fault-tolerant  comput¬ 
ing,  multimedia,  simulation,  algorithms,  logic, 
languages,  operating  systems,  and  the  entire  array 
of  applications. 

Europe’s  strength  is  diversity;  its  weakness  is 
insufficient  drive  toward  implementations  and 
technology  transfer. 

European  high-performance  computing  will 
succeed  if  Europeans; 

•  focus  on  issues,  not  politics; 

•  concentrate  on  software  and  systems,  not 
on  hardware  con^ionents  (except  for  ASIC 
accelerators);  and 

•  open  up  and  accept  technological  globaliza¬ 
tion  by  becoming  partners  in  Joint  projects. 

Although  pockets  of  excellent  research  exist  in 
European  computer  science,  especially  theory,  it 
seems  unlikely  that  Europeans  will  be  first  in  a 
teraFLOPS  race,  but  they  will  impress  us  with: 


•  contributing  to  the  foundations  for  the  next 
generations  of  computing  (e.g.,  molecular 
computing); 

•  providing  standardized  infrastructure  for 
computerization  and  networking; 

•  innovative  applications  (telephone  services, 
multimedia,  smart  cards,  intelligent  badges, 
etc.);  and 

«  developing  and  leading  in  selected  applica¬ 
tions  among  grand  challenges  and  prob¬ 
lems. 

Let  us  assess  the  status  of  high-performance 
computing,  and  outline  challenges  by  examining 
steps  of  design  methodology,  as  shown  in  Table  7. 


Table  7— Design  Methodology  for 
Parallel  Systems  Design 

Application(s)  definition  (application  semantics) 

_ Algorithm(s)  development _ 

Computational  characteristics  specification 

Computational  model  specification 

_ Architecture  design _ 

Implementation 


Ideally,  computer  architectures  should  be 
ariven  by  applications,  but  often  the  reverse  hap¬ 
pens.  Computer  architects  frequently  design  their 
systems  in  a  vacuum,  detached  from  applications 
that  they  usually  do  not  understand,  and  with  major 
concerns  regarding  performance  measured  in  fre¬ 
quently  useless  MFLOPS,  GLOPS,  and  the  likes. 
The  major  challenge  today  is  to  reverse  this  trend 
and  close  the  gap  between  computer  architects  and 
scientists. 

An  application  semantics  language  is  needed 
that  would  capture  the  computational  characteris¬ 
tics  of  an  application.  Such  a  family  of  languages, 
together  with  translators  or  compilers,  must  be 
developed.  The  main  bottleneck  exists  in  computa¬ 
tional  models.  Since  Flynn’s  taxonomy  and 
Valiant’s  PRAM  and  Bulk  Synchronous  Parallel 


5.2 


E8NIB  92-08 


models,  very  little  has  lu^pened  in  modeling  paral- 
lel/distribut^  computing.  Models  are  critical  for 
deigning  and  developing  new  generations  of  com¬ 
puter  architectures.  Today’s  architectures  are 
driven  by  technology  and  network  topologies. 
Pertu^s  more  than  150  parallel  systems  were  built, 
only  a  few  were  significant  in  concepts  and  contri¬ 
butions.  Tliis  process  is  long  and  expensive.  The 
creation  of  adequate  models  would  permit  simula¬ 
tion  and  faster  development  of  machines  that  would 
suppou  the  new  generation  of  application  semantics 
languages. 

The  U.S.  High-Perfonriance  Conq)uting  pro¬ 
gram  gives  an  excellent  opportunity  to  support 
these  types  of  innovative  efforts.  To  make  parallel 
conqiuting  viable  and  commercially  successful,  one 
of  the  following  must  hiqipen: 

1.  A  "killer"  application,  equivalent  to  word 
processing,  spreadsheets,  or  databases  in 
personal  conqiuters,  must  be  developed;  or 

2.  Concentrated  efforts  in  developing  applica¬ 
tion  semantics  languages,  conqiutational 
characteristics,  and  models  must  be  pur¬ 
sued. 

It  is  unavoidable  that  new  computers  will 
feature  hybrid  technologies  (such  as  optical,  neural, 
molecular  computing)  and  will  ultimately  use  "ffee 
space"  communication  technology  to  support  traffic 
requirements,  even  for  the  most  demanding  appli¬ 
cations. 

The  technology  will  continue  its  progress;  by 
the  end  of  this  decwle  we  should  expect  GFLOPS 
microprocessors.  Gigabit  memory  chips.  Terabit 
optical  diskettes,  and  Terabit/s  communication 
links.  The  ultimate  challenge  will  be  to  harness 
this  incredible  power  by  developing  a  new  genera¬ 
tion  of  parallel  architectures,  models,  languages, 
and  sof^are. 
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OTTRODUCTION 

Con^utational  fluid  dynamics  (CFD)  problems 
are  very  large.  Solving  a  flow  problem  for  indus¬ 
trial  {qrplications  from  only  basic  principles  is 
in^mssible,  even  on  tomorrow’s  fast  computers. 
Many  approximations  and  models  are  necessary  to 
make  problems  tractable.  This  large  {^petite  for 
conq)uting  power  is  what  attracts  fluid  modelers  to 
parallel  confuting. 

Application  calculations  in  CFD  are  used  to 
make  operational  or  design  decisions.  Application 
work  is  usually  distinguished  from  research  work 
in  that  the  geometries  and  flows  are  much  more 
complex;  this  conqjlexity  requires  larger  comput¬ 
ers.  The  results  must  also  be  timely,  which  re¬ 
quires  faster  computers.  For  exan^le,  weather 
prediction  at  a  rate  of  one  prediction  day  per  day  is 
too  slow.  Design  calculations  are  also  time  critical 
because  a  large  matrix  of  alternatives  is  usually 
requited,  rather  than  one  or  two  demonstration 
predictions.  Weekend  turnaround  of  a  job  is  not 
sufficient  for  q)plication  work. 

Over  the  past  thrf's  decades,  available  comput¬ 
er  performance  has  increased  steadily.  CFD  pro¬ 
grammers  have  been  able  to  move  their  Fortran 
code  from  one  machine  to  another  with  little  diffi¬ 


culty.  The  biggest  change  that  has  taken  place  in 
the  development  process  with  respect  to  numerical 
modelers  was  moving  from  serial  to  vector  ma¬ 
chines.  In  order  to  tap  performance,  CFD  pro¬ 
grammers  had  to  take  these  vector  machines  into 
consideration.  However,  much  of  the  vectorization 
was  handled  by  the  compiler.  Another  change  is 
from  vector  machines  to  parallel  machines.  The 
slow  pace  this  change  is  taking  is  an  indication  of 
its  size  and  complexity. 

Producing  efficient  predictions  requires  consid¬ 
eration  of  six  components; 

•  Computers 

•  Models 

•  Algorithms 

•  Computer  Languages 

•  Graphics 

•  Organization. 

Ail  of  these  parts  must  fit  together  for  the  whole 
system  to  work  efficiently.  If  the  co^^}uter  is 
changed,  how  many  of  the  other  components  need 
modifying  or  replacing?  Languages  and  algorithms 
are  closely  coupled  to  the  computer  type.  A  loose¬ 
ly  coupled  organization  that  was  sufficient  for 
operating  with  serial  machines  may  not  be  suffi- 
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cient  with  large  parallel  machines.  If  most  compo¬ 
nents  change,  we  have  a  revolution.  If  old  compo¬ 
nents  and  processes  can  be  used  in  the  new  system, 
then  we  can  have  evolution.  Let  us  look  at  some 
European  examples  of  CFD  components  associated 
with  parallel  machines. 

COMPONENTS  OF  AN  APPLICATION 

Computers 

Most  large  application  work  is  currently  done 
on  CRAYs.  An  exception  is  the  work  of  Dale 
King  at  British  Aerospace.  He  used  a  transputer- 
based  parallel  machine  for  Airbus  flow  predictions. 
In  tlie  near  future  many  organizations  in  Europe 
will  be  considering  parallel  computers  for  applica¬ 
tion  work;  candidates  include: 

•  new  CRAY  with  more  processors, 

•  improved  Intel  i860  with  faster  communica¬ 
tions,  and 

•  transputer  machines  based  on  the  promised 
79000  processor. 

These  machines  will  be  the  next  step  in  evolution¬ 
ary  development. 

Selection  criteria  for  new  computers  for  appli¬ 
cations  is  much  more  conservative  than  criteria  for 
computers  used  for  research— the  price  of  failure 
on  applications  is  higher.  Therefore,  managers  do 
not  like  to  take  large  steps  on  unknown  territory. 

An  example  of  an  evolutionary  (step  by  step) 
approach  in  selecting  parallel  machines  for  applica¬ 
tions  comes  from  ONERA  (National  Institute  for 
Aerospace  Research  and  Studies).  It  is  an  organi¬ 
zation  in  France’s  Ministry  of  Defense  that  pro¬ 
vides  engineering  support  for  SNECMA,  Matra, 
and  Aerospatiale;  work  and  goals  are  application 
oriented.  Dr.  Pierre  Leca  of  the  ONERA  comput¬ 
er  science  department  stated  that  his  goal  was  to  be 
doing  applications  on  massively  parallel  computers 
by  1993.  To  accomplish  this  task,  he  works  close¬ 
ly  with  computer  manufacturers,  CFD  engineers, 
^  software  developers. 

He  has  been  doing  research  with  parallel  com¬ 
puters  for  1 1  years,  gaining  experience  with  sever¬ 
al  different  machines.  Early  in  the  game  he  decid¬ 
ed  MIMD  computers  were  the  best  for  these  appli¬ 
cations;  he  tried  transputers.  Since  1988  Leca  has 


been  working  with  Intel  parallel  computers.  His 
present  one  has  128  processors.  He  compares  its 
performance  with  their  Cray  YMP  by  running 
competing  algorithms  for  many  flow  classifica¬ 
tions.  Both  machines  can  be  classified  as  conserva¬ 
tive.  Both  companies  have  huge  computer  devel¬ 
opment  resources,  mature  software  tools,  and 
existing  processors. 

In  the  U.K.  we  find  examples  of  many  differ¬ 
ent  machines  in  the  universities.  However,  the 
biggest  emphasis  has  been  on  transputers.  Step¬ 
ping  forward  with  the  transputer  was  more  revolu¬ 
tionary  than  development  of  machines  with  existing 
processors.  The  main  disadvantage  of  transputers 
for  application  work  is  their  lack  of  computational 
power.  Many  processors  must  be  used  to  obtain 
competitive  performance.  The  University  of  Edin¬ 
burgh  uses  a  Meiko  computer  with  more  than  400 
transputers  for  their  application  work. 

Models 

The  development  of  models  that  describe  the 
physics  of  the  problem  is  currently  the  highest 
priority  of  most  engineering  organizations  in  Eu¬ 
rope  and  America.  Differencing  schemes  and 
parallel  algorithms  usually  have  lower  priorities. 
Fortunately,  most  physics  models  added  on  top  of 
a  CFD  calculation  are  easy  to  do  in  parallel. 

Modeling  the  physics  is  still  the  biggest  prob¬ 
lem  in  ocean  circulation  modeling  and  weather 
prediction.  David  Dent  of  the  European  Center  for 
Medium-Range  Weather  Forecasts  in  Reading, 
U.K.,  reports  that  as  more  computer  power  be¬ 
comes  available,  it  will  be  devoted  to  physics, 
rather  than  to  increased  grid  size 

The  Admiralty  Research  Establishment  does 
ocean  modeling  to  improve  the  application  of  sonar 
to  the  fleet.  Dr.  A.  D.  Heathershaw'  at  ARE 
Portland  selects  physical  models  that  fit  parallel 
machines.  Approximations  are  made  to  best  utilize 
die  machines  they  have  available.  This  type  of 
model  selection  is  the  next  evolutionary  step  in 
applying  models  on  parallel  machines. 

Algorithms 

Application  codes  are  currently  evolving  as 
more  processors  are  added  to  the  latest  Crays. 
When  these  codes  are  moved  to  more  parallel 
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con:q>uters,  the  algorithms  may  require  major 
changes. 

CFD  covers  a  broad  range  of  problem  types. 
Three  types  (elliptic,  parabolic,  and  hyperbolic) 
may  require  very  different  algorithms,  which  will 
have  different  efficiencies  on  various  parallel  ma¬ 
chines.  The  hyperbolic  problems  may  be  efficient 
with  simple  explicit  schemes  with  local  coupling. 
Elliptic  problems  need  good  global  communication. 
The  len^  and  time  scales  of  parabolic  problems 
will  dictate  the  appropriate  algorithm.  However, 
global  coupling  is  still  dominant  for  most  prob¬ 
lems. 

Domain  decomposition  is  one  of  the  most 
popular  methods  of  making  codes  parallel.  All 
organizations  mentioned  here  use  domain  decompo¬ 
sition.  The  British  Aerospace  Euler  code,  which 
has  a  multiblock  grid,  is  a  good  example.  These 
blocks  form  the  basis  of  the  domains;  the  number 
of  blocks  range  from  200  to  2000.  The  number  of 
processors  is  about  100.  The  parallel  efficiency  of 
domain  decomposition  can  be  high;  however  do¬ 
main  decomposition  reduces  global  coupling,  there¬ 
by  lowering  the  solution  efficiency,  i^atever  the 
combination  of  solution  and  parallel  efficiency,  the 
"problem"  efficiency  is  the  important  efficiency. 

Multigrid  algorithms  provide  good  global 
coupling  for  elliptic  problems.  They  have  good 
solution  efficiency  and  are  an  order  of  magnitude 
faster  than  single-grid  algorithms  in  most  cases. 
Fortunately,  multigrid  algorithms  parallelize,  but 
not  without  difficulty.  Dr.  Leca  pointed  out  that 
the  solution  efficiency  of  multigrid  is  so  high  that 
they  will  accept  the  small  parallel  inefficiency. 
Problem  efficiencies  are  not  well  defined.  This 
handlers  comparisons  of  machines  and  algorithms. 

As  workers  experiment  with  parallel  algo¬ 
rithms,  they  have  been  forced  to  think  in  new  ways 
about  old  problems.  Now  the  parallelism  in  the 
flow  problem  must  be  carefully  considered;  some- 
tintts  this  process  leads  to  deeper  insights  to  the 
problem.  For  example,  multigrid  algorithms  are 
being  pursued  at  many  research  centers  for  parallel 
applications.  Dr.  Achi  Brandt^  reports  that  poor 
convergence  rates  of  recirculating  flow  problems 
with  multigrid  methods  were  traced  back  to  bad 
discretizations.  The  discretizations  had  been  in 
comnmn  use  for  years,  but  the  new  solution  meth¬ 
od  helped  uncover  their  trouble.  He  also  points 
out  that  problems  show  up  in  parallel  efficiency 


when  the  physical  characteristic  lines  are  not 
aligned  with  the  numerical  grid  (the  real  problems 
are  numerical  modeling,  not  parallel  solution).  In 
another  example  of  searching  for  parallelism  in  the 
problem,  Rogier  and  Schneider^  at  ONERA  traded 
a  Monte-Carlo  method  with  its  poor  load  balancing 
for  a  deterministic  method  to  solve  the  Boltzmann 
equation. 

Progranuning  Language 

A  computer  language  is  needed  to  describe  the 
CFD  algorithm  to  the  machine.  The  more  parallel 
the  algorithm,  the  more  parallel  the  language  re¬ 
quirement.  Languages  for  parallel  computers  can 
be  classified  as: 

•  conventional  (requires  a  revolutionary  com¬ 
piler); 

•  conventional  with  parallel  extensions  (the 
evolutionary  approach);  and 

•  parallel  (the  revolutionary  approach). 

Dr.  Keith  Taylor  at  Liverpool  University  uses  both 
parallel  extensions  and  a  parallel  language 
(Occam).  He  said  that  Occam  has  simple  syntax 
and  that  using  Occam  is  the  easiest  way  to  leam 
about  parallel  computers  because  it  is  a  good  de¬ 
sign  language  in  which  to  design  algorithms.  3L 
Fortran  is  also  available  on  transputers  and  works 
quite  well. 

ONERA  CFD  programmers  use  the  evolution¬ 
ary  approach— conventional  language  with  exten¬ 
sions.  They  program  their  Intel  iPSC860  in  For¬ 
tran  with  extensions  to  handle  communication 
among  processors.  Fortran  is  still  the  CFD  stan¬ 
dard.  BLAS  libraries  are  becoming  widely  used. 

The  commercial  CFD  code  developers  I  talked 
with  are  not  actively  putting  their  own  money  into 
massively  parallel  systems.  However,  a  good  point 
was  made  by  Dr.  R.  Johns  of  Ricardo.  He  said 
that  mature  compilers  and  good  software  are  essen¬ 
tial  for  CFD  code  development.  Professor  D.  B. 
Spalding  of  CHAM  also  is  keenly  interested  in 
reliability  and  step-by-step  development  to  produce 
error-free  working  codes.  These  characteristics  are 
not  achieved  with  revolutionary  products. 

Portability  of  Fortran  is  one  reason  workers  in 
small  organizations  can  exist.  They  can  develop 
code  on  a  workstation  and  easily  move  it  to  a 
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for  application  work.  This  process  does  not  exist 
for  parallel  machines  today. 

Grafdiics 

Gny>hics  for  managing  the  input  and  output  of 
the  main  code  is  a  vital  part  of  computational  fluid 
dynamics  because  the  amount  of  data  to  be  evaluat¬ 
ed  is  so  large.  Gr^yihics  algorithms  are  readily 
arallelized  with  domain  decomposition,  they  lack 
.e  difficult  parallelization  of  the  CFD  flow  solv¬ 
ers.  Researchers  at  the  University  of  Edinburgh 
have  had  much  success  in  using  massively  parallel 
transputers  for  graphics.  ONERA  uses  in-house 
packages  running  directly  on  the  Cray. 

Organization 

Organization  is  the  glue  that  holds  the  other 
components  together.  A  tighter,  bigger,  and  more 
experienced  organization  is  required  for  a  parallel 
system  than  a  serial  system: 

•  Tighter  —  because  more  cooperation  is 
needed  to  obtain  efficiency  from  these  coupled 
conqxments.  It  will  be  difficult  for  small  or  loose 
organizations  to  make  progress  on  parallel  CFO 
^iplications.  The  most  progress  is  being  made  in 
large  organizations  like  ONERA.  Collaboration 
will  be  important  for  small  operations.  This  year, 
four  colleges  in  London  started  the  l^ondon  Parallel 
Applications  Center  to  aid  the  collaboration  be¬ 
tween  industry  and  the  colleges. 

•  Bigger  —  because  more  functions  may  have 
to  be  done  in-house  since  there  is  little  portability. 
Parallel  programs  are  larger  and  require  more 
considerations  of  the  computer  architecture. 
Therefore,  more  progrunmers  will  be  needed. 

•  More  G^erienced  —  because  the  initial 
decisions  about  conqniters,  languages,  and  algo¬ 
rithms  are  difficult  to  extrapolate  from  the  old 
systems.  ONERA  has  1 1  years  experience  in 
experimenting  with  CFD  algorithms  on  different 
pi^lel  machines.  Hierelbie,  their  next  ttep  will 
not  be  as  large  (evolution)  as  a  group  jumping 
from  a  Cray  to  a  new  parallel  system. 


CLOSURE 

•  Conservatism  in  respect  to  CFD  application 
work  is  producing  evolutionary  progress 
toward  use  of  parallel  computers. 

•  It  may  be  beneficial  for  American  organiza¬ 
tions  to  cooperate  with  Europeans  in  order 
to  build  on  their  experience. 

•  Algorithm  developments  are  yielding  sub¬ 
stantial  speed  improvements. 

•  Collaboration  is  needed  to  increase  the 
effective  size  of  organizations  and  to  make 
parallel  computers  available  to  more  small 
developers. 

•  Only  large  problems  (top  end)  will  be  done 
on  parallel  machines.  High-powered 
workstations  take  care  of  the  middle  ground. 
This  hampers  the  migration  of  codes  from 
workstations  to  parallel  mainframes. 

•  CFD  methods  need  better  measures  of  per¬ 
formance  so  we  can  conqjare  "problem" 
efficiencies  among  machines  and  algorithms. 

•  A  portable  parallel  language  would  speed 
the  evolution  to  parallel  systems.  Fortran 
with  extensions  is  presently  the  most  porta¬ 
ble  language. 

•  Revolutionary  computers  are  less  likely  to 
be  used  for  applications  in  the  near  future 
than  machines  that  evolve  from  proven 
designs. 
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"HOT-SPOT"  FOCUS 

Activities  associated  with  a  harbor  or  site  may 
become  increasingly  encompassed  in  a  wider  geo¬ 
graphic  consortium  to  improve  the  environment. 
This  geognq)hic  (or  "hot-spot")  focus  is  an  emerg¬ 
ing  European  concept  that  is  being  demonstrated  in 
the  Unit^  Kingdom  (U.K).  A  direct  and  appeal¬ 
ing  characteristic  of  these  consortia  is  that  people 
and  organizations  readily  identify  with  a  particular 
location  or  region.  The  concept  of  in:q)roving  the 
environment  and  conserving  natural  resources  is 
often  strong  in  a  specific  area  of  a  country  or  con- 
tinrat,  hence  motivation  is  strong. 

General  characteristics  of  a  geographic  envi¬ 
ronmental  improvement  consortium  include: 

•  large  and  small  industry  working  together: 

•  divrne  industrial  categories  involved; 

•  nonregulatory  initiative; 

•  die  entire  regional  system  of  inputs  is 
considered: 

•  large  organizations  assume  greater  fiscal 


and  technological  responsibility  (essentially 
a  leadership  role); 

•  represents  a  significant  demonstration  of 
^  industry  commitment  to  the  environment 

outside  individual  plants  or  sites; 

•  central  management  (university,  consul¬ 
tant,  or  nonprofit); 

•  joint  funding  (industry,  government,  and 
foundations);  and 

•  commitment  to  collecting  economic  and 
engineering  information. 

GEOGRAPHIC  INITIATIVES 

Aire  and  Calder  Rivers 

Geographic  initiatives  in  the  U.K.  offer  an 
emerging  European  picture  of  a  concept  that  may 
anticipate  similar  environmental  activity  in  the 
U  S.  In  the  U.K.,  one  of  the  first  geographic 
projects  is  in  the  North  on  the  Aire  and  Calder 
rivers.  These  rivers  begin  in  the  Pennine  Moun¬ 
tains  and  flow  east  about  12  km  to  the  North  Sea. 
On  these  rivers  are  industr)'— and  the  city  of 
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Leeds,  well  known  in  the  past  for  the  woolen 
trade,  and  now  for  textiles,  chemicals,  furniture, 
and  ferro-concrete  manufacturing.  Nearby  are  coal 
fields.  The  rivers  merge  and  flow  into  the  harbor 
at  Hull,  the  third  largest  port  in  the  U.K.,  where 
goods  are  exported  to  Sc^inavia,  the  Confedera¬ 
tion  of  Independent  States  (CIS),  and  Canada. 

This  river  basin  project  has  a  wastewater  emphasis 
and  is  particularly  directed  at  cleaner  technology 
implementation. 

The  U.K.  demonstration  on  the  Aire  and  Cal- 
der  rivers  is  focused  on  waste  minimization  (clean¬ 
er  production)  and  initially  involves  firms  such  as 
Allied  Colloids,  Coca  Cola  Schweppes,  Du  Pont 
Howson,  Hickson  &  Welch,  Lambson  Specialty 
Chemicals,  Lyons  Bakeries,  Sandoz  Chemicals, 
and  Yorlcshire  Chemicals.  The  budget  is  approxi¬ 
mately  £200,000  per  year  for  a  two-year  period 
(based  on  $1.95/£  in  July  1992).  Part  of  these 
resources  are  contributed  by  Her  Majesty’s  Inspec¬ 
torate  of  Pollution  (HMIP),  the  National  Rivers 
Authority  (NRA),  Yorkshire  Water  Services  (a 
local  agency),  and  the  BOC  Ltd.  Foundation  for 
the  Environment  and  Community. 

Mersey  River 

A  second  such  geographic  scheme,  also  focus¬ 
ing  on  pollution  prevention,  is  being  developed  on 
the  Mersey  River.  Also  originating  in  the  Pennine 
Mountains,  the  Mersey  flows  west  about  60  km 
into  the  Atlantic  Ocean.  Large  industrial  input 
occurs  near  Liverpool,  where  there  is  manufactur¬ 
ing  in  chemicals,  sugar  reHning,  flour/grain,  and 
seed  oil.  (Industry  began  here  in  A.D.  1200  with 
fishing.)  The  harbor  for  the  Mersey  is  now  the 
largest  Atlantic  port  in  Europe  and  the  second 
largest  port  in  the  U.K.  This  northern  industrial 
region  will  be  a  geogriqihic  consortium  involving 
large  and  small  groups;  emphasis  will  also  be  on 
waste  minimization  and  wastewater  discharges. 

CONCLUSIONS 

For  the  Navy  and  Department  of  Defense 
(DoD),  some  in^rtant  trends  and  opportunities 


are  emerging  from  regional,  mutual  assistance 
concepts  such  as  these  in  the  U.K.  Some  of  these 
issues  are: 

•  harbors  and  input  to  harbors  may  become 
integral  parts  of  a  broader  geographic 
approach; 

•  opportunities  will  occur  to  provide  active 
leadership  to  assure  broadly  based  efforts 
and  hence  improve  the  likelihood  of  suc¬ 
cess; 

•  a  multimedia  concept  may  also  emerge  that 
would  expand  the  current  water-based 
focus  of  a  harbors  program  to  include 
assessing  terrestrial  impact; 

•  with  the  geographic  approach,  large  orga¬ 
nizations  (such  as  the  Navy)  may  be  asked 
to  assume  fiscal  as  well  as  technical  re¬ 
sponsibility  for  emerging  geographic  con¬ 
sortia. 

For  the  DoD,  the  opportunities  and  constraints 
can  be  similarly  described  for  Army  facilities, 

Aimy  Corps  of  Engineers  activities.  Air  Force 
bases,  etc.  At  this  time,  the  opportunity  to  orga¬ 
nize  and  stimulate  such  a  wider  geographic  consor¬ 
tium  in  the  U.S.  could  be  of  substantial  benefit. 
With  such  an  emphasis,  the  progress  of  the  group 
of  industries  is  given  more  weight.  The  communi¬ 
ty  and  public  thus  have  a  broader,  shared  view  of 
the  needs  and  progress  in  their  geographic  environ¬ 
ment.  These  evolving  concepts  could  well  be 
important  models  for  DoD  evaluation. 


Point  of  Contact 

Mr.  Douglas  Robinson 
Department  of  Trade  and  Industry 
Environment  Unit 
151  Buckingham  Palace  Road 
London  SWIW  9SS 
United  Kingdom 
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Research  and  Development  in  the  Abbey  — 
Polymer  Processing  and  Properties 

i  •  ' 

by  Joseph  H.  Magill,  Liaison  Scientist  for  Polymeric  Materials  for  the  Office  of  Naval 
Research  European  Office.  Dr.  MagiH  joined  ONR  Europe  from  the  University  of 
Pittsburgh,  Pennsylvania,  where  he  held  Professorships  jointly  in  Materials  Science  and 
Engineering  and  in  Chemical  and  Petroleum  Engineering. 

Keywords:  reinforcement;  characterizations;  morphology;  synthesis;  modeling 


INTRODUCTION 

For  a  decade,  the  13th  century  Rolduc  Abbey, 
in  Kerkrade,  The  Netherlands,  has  been  the  site  of 
conferences  on  integrating  fundamental  science  and 
technology.  The  latest  conference,  "Optimizing 
Production,  Processing  and  Properties  of  Poly¬ 
mers,"  held  May  10-14,  1992,  comprised  lectures 
and  posters  that  ranged  from  theoretical  to  practi¬ 
cal. 

The  conference  was  opened  by  the  Director  of 
the  Dutch  State  Mines  Corporation  (DSM)  who 
stressed  that  any  "materials  science  group  of  the 
future  must  involve  polymers"  to  be  worthy  of  its 
name.  He  also  mentioned  the  serious  problem  of 
"plastic  waste"— a  topic  that  he  regretted  was  not 
featured  in  this  meeting.  Since  many  presentations 
were  made,  I  describe  only  salient  items. 

PROPERTIES 

Professor  Rolf  MQlhaupt  (Materials  Research 
Institute,  Freiburg)  spoke  on  "Self-reinforced  Poly¬ 
mer  Microcomposites."  He  en^hasized  the  basic 
idea  behind  classical  macroscopic  reinforcement, 
shifting  the  focus  of  his  talk  toward  n'fwer  tech¬ 
niques  now  being  used  in  advanced  microcomposite 
materials.  Here  he  stressed  that  two  major  types; 

1.  rigid-rod  liquid  crystalline  polymers  with 
flexible-coil  polymers,  and 

2.  dual  functional  monomers 


can  be  converted  in  minutes  through  reactive  injec¬ 
tion  molding  (RIM)  technology  into  isotropic  and 
anisotropic  microphases  dispersed  in  a  polymer 
matrix. 

Reinforcement  here  occurs  at  the  molecular  or 
supramolecular  level,  avoiding  the  conventional 
processing  problems  of  intractibility.  Benefits 
were  deriv^  also  from  morphological  control 
induced  via  processing,  thus  substantially  enhanc¬ 
ing  material  stiffness  and  strength.  He  provided 
well-documented  chemical  and  physical  character¬ 
izations  of  these  polymers,  describing  as  an  exam¬ 
ple  polyols  that  are  new  intermediates  for  the 
production  of  polyurethene  microcomposites.  Here 
for  example,  stable  dispersions  of  anisotropic 
"polyamide  particles"  with  aspect  ratio  5-10:1  are 
created.  The  importance  of  interfacial  adhesion 
and/or  bonding  was  stressed  as  a  key  point  for 
enhancing  mechanical  properties. 

Dr.  John  Moalli  (Dept,  of  Materials  Science  & 
Engineering,  MIT)  described  the  "Mechanical 
Behavior  of  High  Performance  Polymer  Fibers. " 

He  devised  a  new  technique  to  evaluate  the  trans¬ 
verse  strength  of  simple  fibers  in  term  of  their 
flexural  stiffness  and  compressive  modulus.  Nota¬ 
bly  the  compressive  modulus  is  always  found  to  be 
lower  than  the  tensile  modulus  in  materials. 

Moalli  reported  that  physical  vapor  deposition  of 
aluminum  oxide  considerably  enhanced  the  com¬ 
pressive  strength  and  also  reduced  the  fiber 
radialthermal  expansion  by  a  factor  x  2.  Howev¬ 
er,  in  the  writer’s  opinion,  the  procedure  although 
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novel,  does  not  appear  to  be  economically  viable  at 
this  time. 

In  discussing  the  paramount  factors  (still  debat¬ 
able)  that  govern  fracture,  deformation,  and  ulti¬ 
mate  ductility  of  polymeric  systems,  MCM  van  der 
Sander  (Center  for  Polymer  &  Composites,  Uni¬ 
versity  of  Eindhoven,  The  Netherlands)  referred  to 
the  important  contribution  of  Kramer  et  al.‘  and 
Wu^  in  this  field.  He  focused  on  the  deformation 
behavior  of  polystyrene  (PS)  and  blends  of  PS  with 
poly  (2,6-  dimethyl- 1,4-phenylene  ether)(PC). 
Notably,  the  strain  to  break  was  raised  to  its  esti¬ 
mated  theoretical  value  in  PS  films  (<  O.S/tm 
thick)  or  in  PS  if  it  was  filled  with  a  high  volume 
of  tiny  particles  ("holes").  It  was  claimed  that 
"connections"  between  these  interstices  (crazes) 
sustain  the  applied  stiesses  during  stretching,  with¬ 
out  premature  crack  formation  of  the  "network"! 

In  practice,  PS  is  macroscopically  brittle  at  <  S 
percent,  whereas  PC  undergoes  strain-at-break  up 
to  approximately  ISO  percent. 

"Block  Copolymers  of  Polyamides"  were 
discussed  by  Dr.  J.  Roda  (Institute  of  Chemical 
Technology,  Prague,  Czechoslovakia).  In  these 
polymers  produced  by  RIM  technology,  the  hard 
segment  was  a  polyamide  6(N6)  and  the  soft  seg¬ 
ment  was  polyether,  polyesters,  or  diene  elasto¬ 
mers.  The  objective  of  polyesters  or  diene  elasto¬ 
mers  in  this  work,  was  to  establish  optimal  condi¬ 
tions  (synthetic  and  chemical)  for  maximum  tough¬ 
ness,  which  was  then  assessed  by  notch  impact  and 
other  tests.  In  several  systems,  specifically  N6 
with  increasing  polybutadiene  (PB)  concentration, 
the  elongation  at  break  was  found  to  increase  to 
100  percent  at  an  optimal  level  of  10-15  percent 
PB  where  the  elongation  corresponded  to  a  tough¬ 
ness  enhancement  of  one  order.  In  general,  it  was 
found  that  most  of  the  structures  used  in  these 
block  copolymer  model  studies  did  not  comply 
with  theory,  but  they  could  be  attained  via 
kinetically  controlled  polymerization. 

Professor  Robert  P.  Burford  (Chemical  Engi¬ 
neering/Industrial  Chemistry)  University  of  New 
South  Wales,  Australia,  discussed  "Environmental 
Issues  of  Cryoffactured  Elastomers."  Cryogrind- 
ing  was  found  to  be  advantageous  over  other  recy¬ 
cling  techniques;  it  was  claimed  to  enable  rubber 
(for  example)  to  be  recycled  as  small  particles  10 
^tm  to  2  mm  in  diameter.  The  ease  of  grinding  is 


related  to  the  fracture  toughness  of  the  elastomer 
defined  as  Gic  (strain  energy  release  rate).  Al¬ 
though  molecular  characterization  was  not  provided 
after  cryogrinding,  morphological  characterization 
was  made  of  cryofractured  elastomers  below  Tg. 

In  PBs  there  was  highlighted,  under  the  scanning 
electron  microscope  (SEM),  highly  extended 
crazed  matter  that  appeared  to  exhibit  nodular 
structures  with  particles  1-3  fim  in  diameter. 

An  interesting  presentation  was  given  by  Dr. 
G.R.  Smith  and  his  associates  (Allied  Signal  Inc., 
Corporate  Research  and  Technology,  Morristov/n, 
NJ)  on  "Electro-optical  Polymers  and  Wave-grind¬ 
ing  Devices."  Dr.  Smith  reviewed  the  basic  prin¬ 
ciples  and  methodology  associated  with  electro- 
optical  response.  He  highlighted  selected  polymers 
and  pointed  out  that  polymers  with  non-centrosym- 
metric  structures  were  ideal  materials  for  the  trans¬ 
mission  of  planar  guided  optical  waves.  Smith 
cited  many  exanqiles  of  the  interesting  work  per¬ 
formed  at  Allied  Signals  in  recent  years.®  '* 

CHARACTERIZATION 

Professor  L.  Monnerie  and  co-workers  (Ecole 
Sup6reure  de  Physique  et  de  Chimie  Industrelles  de 
Paris,  Paris)  described  "NMR  (nuclear  magnetic 
resonance)  Investigations  of  Interphases  in  Elasto¬ 
mer  Blends."  He  pointed  out  that  the  interphase 
was  important  mechanically.  He  added  that,  in 
addition  to  the  conventional  techniques  for  charac¬ 
terization,  X-ray,  differential  scanning  calorimetry 
(DSC),  and  viscoelasticity,  carbon  13  nuclear 
magnetic  resonance  spectroscopy  ('®CNMR)  can  be 
used  to  discriminate  between  phases  and  the  effects 
of  block  molecular  weight  and  heat  treatments  on 
physical  behavior.  Phases  (composed  of  chemical 
units  in  the  polymer)  must  have  different  relaxation 
(diffusion)  times  (designated  as  the  well  known  T], 
T2  or  Tip  types  of  molecular  motions)  for  the 
analysis.  Molecular  compatibility,  incompatibility 
or  partial  miscibility  has  been  observed  depending 
on  temperature  and  blend  contbrmations.  Systems 
such  as 

•  poly(methylmethacylate)  - 
poly(butylacrylate)  and 

•  cis  1,4  or  poly(isopre!ne)-poly(butadiene)  of 
the  1,4  or  1,2  types 
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have  been  investigated  extensively  with  changes  in 
composition  for  evaluation. 

Recently,  ai^  at  this  meeting  also,  "Supercrit¬ 
ical  Fluid  Fractionation  of  Polymers  and  Blends" 
have  featured  in  the  preparation  and  characteriza¬ 
tion  of  gram  size  samples  of  narrow  molecular 
weight  distribution  fractions  for  substantial  re¬ 
search  and  development  work.  Professor  Mark 
McHugh  (Department  of  Chemical  Engineering, 
Johns  Hopkins  University,  U.S.)  provided  an 
informative  talk  on  this  topic.  He  pointed  out  the 
underlying  principles  behind  the  procedure,  using  a 
complicated  copolymer  of  composition  poly(ethyl- 
co-methyl  acrylate)  (87.8  mol.  percent  ethylene/ 
12.2  mol.  percent  methylacrylate)  and  two  differ¬ 
ent  solvents,  (a)  propane  and  (b)  chlorodifluoro- 
methane,  respectively.  He  then  demonstrated  the 
efficacy  of  the  meth^.  Interestingly,  fractionation 
may  be  based  only  on  molecular  weight  and  not  on 
polymer  composition  using  solvent  (a),  which  is  a 
very  good  supercritical  solvent  for  this  copolymer. 

Di  W.M.  Bunge  (DSM  Research,  Geleen, 

The  Netherlands)  described  the  molecular  charac¬ 
terization  of  linear  low  density  polyethylene 
(LLDPE)  by  using  the  rising  temperature  elution 
fractionation  method,  which  is  very  suitable  for 
elucidating  the  complex  chain  microstructure  of 
these  heterogeneous  polymers.  Sample  molecular 
weight  distributions  were  provided.  In  a  similar 
Vein,  Prof.  G.  Groeninck  et  al.  made  a  detailed 
thermal  and  morphological  study  of  1-octene 
LLDPE  fractions.  He  pointed  out  (like  Bunge)  the 
broad  molecular  weight  and  melting  characteristics 
of  this  polymer  The  polymer  fractions  investigat¬ 
ed  did  exhibit  narrow  melting  isotherms  that  shift¬ 
ed  to  lower  temperatures.  The  samples  also  de¬ 
creased  in  size,  mass,  and  crystallinity  with  in¬ 
creasing  side  branching  content  Morphologically, 
this  trend  also  resulted  in  a  lov/er  spherulitic 
growth  rate  and  smaller  sphemlites  obtained  with 
increasing  side  branching  content  in  the  polymer 
fractions.  This  is  an  important  factor  in  determin¬ 
ing  polymer  properties. 

For  laboratory  and  bench  scale  preparations. 
Dr.  Ludo  Klcinjens  (DSM  Research,  Geleen,  The 
Netherlands)  gave  a  tal,k  on  the  production  of 
"High  Purity  Polymers,"  using  near  or  supercritical 
thermodynamic  conditions  to  remove  impurities 
from  the  polyiner  product  (the  same  technique  is 
used  commercially  for  decaffeinating  coffee,  i'he 


efficiercy  of  the  separations  is  based  upon  the  high 
diffusion  of  small  molecules  under  near- 
supercritical  conditions.  Some  results  were  pre¬ 
sented  for  acrylonitrile  containing  co-polymer 
blends  and  homopolymers  such  as  ethylene-propyl- 
ene-diene  terpolymers,  demonstrating  the  feasibility 
of  the  technique  for  bench  experimentation. 
Kleinjens  reported  that  the  rapid  expansion  of  a 
dilute  compressed  polymer  solution  through  an 
orifice  was  an  effective  way  to  improve  polymer 
purity  of  the  "flashed-off"  material.  In  the  writer’s 
opinion,  these  procedures  may  be  considered  ad¬ 
vantageous  for  the  preparation  of  qualit>'  polymers 
for  scientific  investigations  and  applications.  As 
yet.  they  have  not  been  sufficiently  exploited  for 
this  purpose. 

MISOPHASES 

The  author  of  this  article  [Dr.  J.H.  Magill], 
presented  a  lecture  on  "Polyphosphazene  Meso- 
phase;  Structure  Morphology  and  Properties."  I 
briefly  outlined  synthetic  methods  for  making 
polyphosphazenes.  I  also  emphasized  tne  need  for 
a  more  commercially  viable  polymerization  process 
than  is  presently  available  for  polyphosphazenes, 
notwithstanding  the  wide  range  of  potential  applica¬ 
tions  of  these  polymers.  On  the  structure  and 
morphology  side,  interest  was  focused  on  the 
influence  of  the  side  group  size  on  the  thermotrop¬ 
ic  behavior  of  poly(o:tphosphazenes),  particularly 
w:tli  alkoxy  or  aryloxy  substituents.  Large 
substituents  were  found  to  confer  some  degree  cf 
immobility  that  not  only  reduced  crystallizability, 
but  also  influenced  side  chain  mobility,  giving  rise 
to  substantial  disorder  at  and  above  the  poi^nner 
T(l)  temperature,  which  sets  an  upper  limit  for 
engineering  applications  of  these  polymers.  Mor¬ 
phological  investigations  made  on 
po!y(oxyphosphazenes)  that  have  been  heat-treated 
through  T(l),  mesophase  transition  temperature,  is 
consistent  with  a  chain  extended  columnar  mor¬ 
phology.  Side  chain  crystallization  occurs  when¬ 
ever  the  mesophase  cooled  below  T(l).  These  two 
models  of  ordering  are  in  accordance  with  a  hexati- 
cally  packed  thermotropic  phase  associated  with 
variously  mobile  side  groups.  The  order  in  the 
mesophase  is  short  range.  Interestingly,  when  side 
groups  are  bonded  directly  to  the  phosphorous, 
mesophase  formation  still  occurs,  and  is  due 
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primarily  to  the  presence  of  an  inherently  flexible  - 
P=N-  polymer  backbone.  Poly(phosph^nes) 
appear  to  be  more  thermally  stable  than 
poly(oxyphosphazenes),  a  fact  of  practical  impor¬ 
tance  in  mechanical  and  other  applications. 

Professor  Martin  Mdller  (Chemische 
Technologie,  University  of  Twente,  Enschede,  The 
Netherlands)  gave  an  interesting  lecture  on  well 
characterized  poly(di-n-alkylsiloxanes)  and  poly(di- 
n-alkylsilanes)  that  form  mesophases  that  depict 
columnar  flexible  backbone  mesophases  somewhat 
similar  to  the  extended  chain  two-dimensional 
polyphosphazenes  reported  by  the  writer: 

R  R 

I  I 

-[Si-o]x-  -[-Si-]x 

1  I 

R  R 

poly(di-n-alkylsiioxanes)  poly(di-alkylsilane) 

Fig.  1  —Columnar  flexible  backbone  mesophases 

Mbller’s  research  group  made  careful  mechanical 

creep/recovery,  thermal,  and  and  ^’Si  NMR 

characterizations  of  anionically  prepared  materials 
with  various  kinds  of  side  groups  that  influence 
melting  and  mesophase  and  glass  transition  tenq)er- 
atures.  These  factors  are  important  and  form  the 
basis  for  designing  macromolecular  materials  with 
special  phase  behavior.  Through  anionic  ring 
opening  polymerization,  molecular  mass  can  be 
controlled  in  homo  and  block  copolymers. 


Other  more  conventional  liquid  crystalline 
polymers  were  reported  on  by  Dr.  J.A.J.  Jansen  et 
al.  (Philips  Conference  Centre  Plastics  B.V.,  Eind¬ 
hoven,  Tlie  Netherlands).  The  lecture  title  was 
"Characterization  of  Molecular  Orientation  Profiles 
in  Liquid  Crystalline  Polymeric  Products  by 
FTIR,"  and  investigations  focused  on  development 
and  use  of  the  new,  low-cost  Fourier  transform 
reflectance  microscope  for  rapid  analysis  of  inject¬ 
ed  molded  polymers  obtained  as  fractions  from 
chromatographic  and  extraction  experiments. 
Specimen  spot  size  was  0.3  mm,  and  detection 
limits  were  in  the  nanogram  range.  From  dichroic 
measurements  performed  in  several  directions,  the 
molecular  orientation  of  the  sample  and  its  order 
parameter  were  determined.  Ultra-milling  of  spec¬ 
imens  were  used  to  remove  thin  layers,  which  may 
be  equivalent  to  a  depth-profiling  technique.  This 
kind  of  procedure  with  commercial  polymers  facili¬ 
tates  their  morphological  comparison  for  evaluation 
of  end-use  performance. 

An  interesting  presentation  on  aromatic  A-B 
polyesters  was  given  by  Professor  Walter  Heitz 
[Philipps  University,  Marburg,  Federal  Republic  of 
Germany  (FRG)]  to  produce  polymers  with  ther¬ 
motropic  properties.  Potential  optical  properties 
were  not  addressed.  A  wide  variety  of  soluble  and 
fusible  derivatives  of  poly(l,4  -phenylene  vinylene) 
(PPV)  were  also  made  by  the  Pd-catalysed 
arylation  of  ethylene  with  dihalogenarenes  through 
structural  modiheations  of  PPV.  Soluble  and  fus¬ 
ible  rod-like  derivatives  of  PPV  with  potential  as 
devices  in  materials  technology  are  also  featured  in 
this  work  according  to  the  scheme: 


Hal— Ar~Hal  +  HjC^CHj 
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MORPHOLOGY 

Morphological  diaracterization  of  "Polysaccha¬ 
ride  Crystals"  was  meticulously  performed  and 
described  by  Prof.  H.  Chan:^  [National  Center  for 
Scientific  Research  (CNRS),  France].  Crystals 
ranging  from  several  microns  down  to  a  few  tenths 
of  a  luuiometer  were  investigated  by  meticulous 
electron  crystallogri^hy.  Linear  polysaccharides 
that  are  found  in  plants,  organelles,  or  precipitated 
from  solution  were  characterized  to  yield  well- 
defined  diffraction  patterns  that  were  used  to  re¬ 
solve  molecular  and  crystal  structure  from  amy- 
lose,  xylan,  mannan,  and  nigeran  crystals.  Poly- 
sacdiaride  materials  are  important  in  terms  of  their 
biological  function,  phase  stability,  and  natural 
origin. 
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Fig.  4— ABA  copoiymers 


Aspects  of  polyethylene  extrusion  were  ad¬ 
dressed  by  Professor  Andrew  Keller  (H.H.  Wills 
Physics,  Laboratory,  Bristol,  U.K.).  He  pointed 
out  that  unusual  melt  flow  irregularities  are  ob¬ 
served  in  a  narrow  processing  ten:q)erature  window 
on  either  side  of  15  rc  where  extrusion  rate  ex¬ 
hibits  a  sharp  maximum.  This  "transition-like" 
bdiavior  was  speculated  to  be  associated  with  chain 
extension  via  elongational  flow,  conesponding  to 
an  induced  phase  transformation  in  high  molecular 
weight  poly(ethylenes).  The  proposed  phase  transi¬ 
tion  may  be  characterized  by  an  "extensional  flow- 
stretch  transition"  producing  a  mobile  hexagonal 
phase  in  poly(ethylene)  with  a  potentially  filamen¬ 
tous  morphology.  However,  the  morphological 
features  mentioned  here  must  still  be  documented 
experimentally. 

SYNTHESIS  AND  POLYMERIZATION 
PROCESSES 

There  were  several  papers  and  posters  in  this 
category.  Some  are  reported  below.  Professor  H. 
Hdcker  (RWTH,  Aachra,  FRO)  discussed  "Poly¬ 
mers  and  Cqmlymers  from  Cyclic  ‘Monomers'" 
sriierein  cydic  lactones  (such  as  caprolactone  and 
pivalactone)  were  ring  qwned  with  alcoholates  and 
other  imitators  to  produce  AB-cqpolymers  accord¬ 
ing  to  the  sdwme: 


These  polymers  were  characterized  and  found  to  be 
typically  M,^  -  60K  and  Mn  ~  40K. 

MODELING 

Professor  C.  Kiparissades  (Department  Chemi¬ 
cal  Engineering,  University  of  Thessaloniki, 
Greece)  mathematically  modeled  diffusion-control- 
led  free  radical  polymerization  reactions.  A  gener¬ 
al  mathematical  framework,  using  parameters  with 
a  physical  meaning,  was  presented  and  found  to  be 
in  agreement  with  experimental  data  based  on 
monomer  conversion,  total  radical  indentation,  and 
mean  molecular  weight  obtained  from  different 
laboratories.  The  model  concentrated  on  describ¬ 
ing  and  predicting  the  free  radical  polymerization 
kinetics  of  styrene  and  methyl  metlucrylate, 
azobisisobutyronitrile,  and  other  related  thermal 
initiators. 

A  new  synthetic  t^roach  to  poiy(p-xylene) 
(PPX)  was  claimed  in  the  program  by  Prof.  Peter 
Simon  (Philipps  University  Marburg,  Germany), 
but  the  lecture  was  not  presented.  Apparently  the 
synthesis  was  based  on  the  pyrolysis  of  dioxidic 
and  diacylesters  of  y,  y'_  dihydroxy  -  p  -  xylene  at 
temperatures  between  5C^  and  9(X)''C  to  pi^uce 
high  molecular  weight  PPX  as  five-standing  films 
that  are  colorless,  stable,  and  slightly  hazy. 
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Dr.  Tom  Loontjens  and  associates  (DSM 
Research,  The  Netherlands)  described  the  prepara¬ 
tion  of  block  copolymers  of  poly(vinylether)  and 
poly(ethylene  glycol)  using  an  anhydrous  organic 
base,  triethylamine  (two  equivalents)  with  respect 
to  initiator  and  also  two  equivalents  of  methanol. 
Claims  were  made  that  clean  polymers  are  pro¬ 
duced  from  vinyl  ether  and  eUiylene  glycol  mono¬ 
mers  by  means  of  living  cationic  polymerizations. 

Considerable  interest  was  shown  in  the  com¬ 
mercial  plastic,  linear  low  density  polyethylene 
(LLDPE).  Two  characterization  and  one  modeling 
paper  were  presented  on  this  polymer.  Leo  Smit 
(DSM  Research,  Geleen,  The  Netherlands)  gave  an 
nice  reactor  modeling  lecture  on  "The  Use  of 
Micromixing  Calculations  in  LLDPE." 

CONCLUSION 

This  conference.  Optimizing  Production,  Pro¬ 
cessing  and  Properties  of  Polymers,  covered  topi¬ 
cal  and  important  aspects  of  polymer  research  Aat 
have  particular  academic  and  technological  value 
for  the  U.S.  Navy.  Novel  papers  on  liquid  crystal¬ 
line  polymers,  composites,  synthesis,  characteriza¬ 
tion,  and  modeling  were  presented.  This  report 
has  described  only  the  highlights. 
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THE  CONFERENCE 

A  conference  to  recognize  the  20th  anniversary 
of  the  establishmfsnt  of  the  Corrosion  and  Protec¬ 
tion  Centre  at  the  University  of  Manchester  Insti¬ 
tute  of  Science  and  Technology  (UMIST)  was  held 
28  June  -  3  July,  1992  at  UMIST,  Manchester, 
U.K.  More  than  420  delegates  attended  from  44 
countries,  with  39  from  the  U.S.  The  proceedings 
will  be  published  by  Pergamon  as  a  special  volume 
of  the  Corrosion  Science  JoumaL 

The  purpose  of  the  conference  was  to  provide 
a  high-level  overview  of  recent  advances  in  corro¬ 
sion  science  engineering  and  technology.  During 
the  5-day  period,  452  papers  were  presented  in  5 
parallel  oral  sessions  and  3  poster  sessions.  In 
corrosion  science  much  interest  was  generated  by 
the  application  of  chaos  theory  to  corrosion  and  in 
corrosion  engineering  by  the  prediction  of  perfor¬ 
mance  and  corrosion  lifetimes  in  service. 


APPLICATION  OF  CHAOS  THEORY 
TO  CORROSION 

The  application  of  chaos  theory  to  corrosion 
was  treated  in  papers  by  J.  Stringer  (Electric  Pow¬ 
er  Research  Institute  (EPRI),  U.S.),  K.  Sieradzki 
(Johns  Hoplcins  University,  U.S.),  S.M.  Sharland 
(Atomic  Energy  Authority  (AEA),  Harwell,  U.K.), 
and  J.  Uruchurtu-Chaudrin  (Institute  de  Investigac- 
iones  Electricas  (HE),  Mexico).  Deterministic 
chaos  is  known  to  oa*  :r  in  a  variety  of  physical 
and  chemical  processes,  including  corrosion.  A 
brief  description  is  as  tbilows:  If  the  state  of  the 
system  is  fully  described  by  a  set  of  /t  independent 
dynamical  variables,  then  its  state  at  a  given  time  t 
is  specified  by  a  point  in  an  n-dimensional  state 


space,  and  subsequent  changes  cause  that  point  to 
move  along  a  path  in  the  state  space  that  is  called  a 
trajectory.  If  the  system  is  initially  perturbed  from 
equilibrium,  it  will  subsequently  evolve  toward  an 
equilibrium  condition,  and  the  locus  of  the  possible 
equilibria  in  state  space  is  called  the  attractor;  the 
trajectory  thqs  moves  toward  the  attractor. 

In  ordinary  (non-chaotic)  circumstances,  the 
evolution  of  the  trajectory  is  essentially  the  same 
for  situations  where  two  initial  states  of  the  system 
are  close  together.  In  fact,  the  trajectories  con¬ 
verge  as  they  approach  the  attractor.  However,  if 
the  system  is  chaotic,  the  paths  will  diverge  expo¬ 
nentially.  The  rate  of  convergence  or  divergence 
is  characterized  by  a  set  of  exponents  called  the 
Ly^unov  exponents,  and  a  condition  for  chaos  is 
that  at  least  one  of  the  Lyapunov  exponents  must 
be  positive. 

For  a  chaotic  system,  the  attractor  is  a  compli¬ 
cated  surface  that  generally  has  fractal  dimensions; 
it  is  called  a  strange  attractor.  Because  of  the 
sensitivity  to  the  initial  conditions,  the  detailed 
behavior  of  chaotic  systems  is  unpredictable,  at 
least  over  long  times.  Systems  that  evolve  in  a 
random  manner  are  a  third  possibility.  They  do 
not  possess  an  attractor,  although  their  evolution  in 
state  space  may  be  constrained;  these  are  called 
stochastic  systems.  The  chaotic  systems  that  were 
examined  by  the  authors  were  corrosion  of  silver 
in  perchloric  acid  (Sieradski),  pitting  of  stainless 
steels  in  chloride  solutions  (Sharland),  and  pitting 
of  aluminum  in  chloride  solutiokis  (Uruchurtu- 
Chaudrin).  Theoretical  treatments  in  the  past  have 
regarded  pitting  as  a  stochastic  process  and  the 
demonstration  that  in  many  instances  it  is  in  fact  a 
chaotic  process  will  impact  the  further  development 
of  basic  theory  and  predictability  approa^'hes  to 
pitting. 
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PREDICTION  OF  PERFORMANCE  AND 
CORROSION  LIFETIMES 

The  prediction  of  performance  and  corrosion 
lifetimes  in  service  was  treated  in  papers  by  J.A. 
Richardson  (International  Computers  Limited 
(ICL),  U.K.)  R.W.  Staehle  (University  of  Minne¬ 
sota.  U.S.),  P.A.  Laycock  (UMIST,  U.K.),  O. 
Forsen  (Helsinki  University,  Finland),  R.T.  White 
(Consultant,  S.A.),  and  J.L.  Dawson  (CAPCIS 
Ltd.,  U.K.).  Approaches  included  corrosion-based 
design  (Staehle),  generalized  extreme  value  statisti¬ 
cal  techniques  for  localized  corrosion  (Laycock), 
computer-based  databases  and  expert  systems 
(Richardson),  as  well  as  a  wide  variety  of  other 
statistical  and  experimental  techniques  (Forsen, 
White,  Dawson).  Several  of  these  approaches 
were  based  on  monitoring  of  corrosion  behavior  or 
analyzing  failures  or  failure  databases,  thereby 
relying  on  extrapolation  or  interpolation  of  existing 
data.  The  ability  to  actually  predict  the  service  life 
of  new  technological  equipment  used  in  new  pro¬ 
cesses  remains  a  challenge  that  has  to  be  met  by 
improved  fundamental  understanding  of  corrosion 
processes.  In  this  sense  true  prediction  still  seems 
a  long  way  off. 

Papers  of  particular  interest  to  the  U.S.  Navy 
were  presented  in  sessions  dealing  with  the  break¬ 
down  of  passivity  by  pitting  and  crevice  corrosion, 
cathodic  protection,  biocorrosion,  and  high-temper¬ 
ature  coatings  for  gas  turbines.  In  the  passivity 
breakdown  area,  ONR-supported  work  was  pre¬ 
sented  by  E.  McCafferty  of  the  Naval  Research 
Laboratory  (NRL)  Washington,  D.C.,  who  de¬ 
scribed  improvements  in  the  pitting  resistance  of 
aluminum  by  ion  implantation  and  the  pH  of  zero 
charge  model  for  wit  initiation,  and  by  H.W.  Pick¬ 
ering  (Pennsylvania  State  University,  U.S.)  who 
described  the  development  of  a  microscopy/local 
probe  method,  using  transparent  polyethylene 
crevice  formers,  which  enables  in-situ  examination 
of  the  crevice  corrosion  process. 

The  output  of  crevice  corrosion  research  is  of 
particular  interest  in  failure  prevention,  either  for 
identifying  resistant  materials  or  for  design  changes 
aimed  at  eliminating  crevice.  In  the  cathodic 
protection  area,  K.E.  Lucas  (NRL),  described  the 


Naval  Sea  Systems  Command  (NAVSEA)  support¬ 
ed  development  of  physical  scale  modeling  tech¬ 
niques  for  designing  impressed-current  cathodic- 
protection  systems  for  ship  hulls.  An  innovative 
feature  of  this  approach  was  the  use  of  scaled 
electrolyte— a  concept  that  was  well  received  by  the 
expert  audience.  Physical  scale  modeling  is  being 
used  by  the  U.S.  Navy  to  identify  more  effective 
distributions  and  placements  of  impressed  current 
anodes  on  ship  hulls. 

BIOCORROSION 

Biocorrosion  topics  were  well  covered  by  four 
oral  papers  and  four  poster  papers.  Of  particular 
interest  were  the  findings  of  V.K.  Gouda  (National 
Research  Centre,  Egypt)  of  biocorrosion  of  904L 
stainless  steel  and  Monel  alloy  400  by  sulfate- 
reducing  bacteria  in  seawater.  The  paper  empha¬ 
sized  the  importance  of  the  correct  level  of  biocidal 
chlorination  and  represents  one  of  the  few  papers 
in  the  technical  literature  describing  carefully 
documented  studies  of  biocorrosion  in  seawater. 
High-temperature  coatings  for  gas  turbines  were 
covered  by  five  oral  presentations  and  two  poster 
papers.  Of  particular  interest  was  the  ONR-sup- 
ported  development  of  the  CR  +  AL  and  CR  +  Si 
co-deposition  techniques  in  pack  cementation  pre¬ 
sented  by  R.A.  Rapp  (Ohio  State  University, 

U.S.),  the  borosilicate  crack  sealants  for  cracked 
silicon  carbide  coatings  on  carbon-carbon  compos¬ 
ites  developed  by  F.J.  Buchanan  (Cambridge  Uni¬ 
versity,  U.K.),  and  the  strong  plea  for  more  funda¬ 
mental  research  in  the  area  of  thermal-mechanical 
fatigue  by  R.V.  Hillery  (General  Electric,  Cincin¬ 
nati,  U.S,).  The  Navy  interest  here  is  in  marine 
gas  turbines  and  in  aircraft  engines  of  the  future 
with  high  thrust-to- weight  ratios. 

CONCLUSIONS 

The  overall  impression  left  by  the  meeting  was 
that  corrosion  research  and  engineering  continue  to 
be  ongoing  dynamic  disciplines,  clearly  capable  of 
absorbing  new  supportive  concepts  developed 
within  other  disciplines  such  as  mathematics  (chaos 
theory)  and  microbiology  (biofilms). 
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International  Arctic  Buoy  Program  Meeting  in  Oslo 

6/  John  P,  Dugan,  an  oceanographer  currently  serving  as  Liaison  Scientist  for 
Physical  Oceanography  at  the  Office  of  Naval  Research  Buropean  Office.  Previously 
he  formed  and  directed  the  Field  Measurements  Department  for  Arete  Associates. 

Earlier,  he  was  at  the  Nava!  Research  Laboratory,  Washington,  D.C. 
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INTRODUCTION 

The  International  Arctic  Buoy  Program  (lABP) 
is  a  collaboration  between  a  half  dozen  countries 
that  are  interested  in  deploying  and  maintaining  an 
array  of  drifting  meteorological  buoys  in  the  Arc¬ 
tic.  This  is  an  important  activity,  as  the  buoys  are 
cruciai  to  the  accuracy  of  weather  forecasts  in  the 
northern  hemisphere  by  U.S.  Navy  and  the  nation¬ 
al  weather  centers  in  the  U.S.,  Canada,  the  United 
Kingdom  (U.K.),  and  Norway.  There  previously 
was  no  activity  to  coordinate  the  several  separate 
national  buoy  programs,  ncr  one  to  coordinate  the 
interests  of  several  agencies  within  the  U.S.  The 
lABP  is  relatively  new,  and  its  second  annual 
meeting  was  held  2-4  June  1992  at  the  Norsk 
Polarinstitutt  in  Oslo,  Norway. 


BASIC  OPERATING  PRINCIPLES 

The  objective  of  the  Program  is  to  establish 
and  maintain  a  network  of  drifting  buoys  and  to 
obtain  data  for  realtime  operational  requirements 
and  research  purposes  in  meteorology  and  ocean¬ 
ography.  This  includes  support  for  Ae  World 
Climate  Research  Program  (WCRP)  and  the  World 
Weather  Watch  (WWW)  Program;  it  will  build  on 
cooperation  among  these  agencies  and  institutions 
having  Arctic  interests. 

The  Program  will  maintain  an  observational 
network  via  the  buoys  and  will  distribute  basic 
meteorological  data  in  realtime  over  the  Global 
Telecommunication  System  (GTS)  for  use  by  the 
many  national  weather  centers  in  their  forecasts. 


The  Program  will  also  ensure  that  the  data  are 
archived  and  members  will  liaise  and  cooperate 
with  other  Arctic  buoy  operators.  The  operational 
area  of  the  Program  includes  the  central  Arctic 
basin  and  its  marginal  seas,  excluding  the  econom¬ 
ic  zones  except  where  agreement  of  the  Coastal 
State  has  been  obtained. 

Basic  meteorological  data  include  the  buoy 
position  and  the  surface  air  pressure  and  tempera¬ 
ture.  Additional  variables  of  interest,  but  not  part 
of  the  necessary  minimum,  are  wind  speed  and 
direction,  snow,  sea  ice  properties,  and  subsurface 
oceanographic  variables.  The  desired  spacing  of 
buoys  is  400  km,  as  required  by  the  WCRP  and 
WWW. 

Participation  in  the  Program  is  open  to  opera¬ 
tional  agencies,  meteorological  and  oceanographic 
institutes,  research  agencies,  and  nongovernmental 
organizations  with  interests  in  the  Arctic.  Present 
participants  include  the  U.S.,  Canada,  Norway, 
U.K.,  Germany,  and  Russia.  The  U.S.  representa¬ 
tive  is  David  Benner  of  the  Naval  Polar  Oceanog¬ 
raphy  Center  in  Suitlard,  Maryland;  he  represents 
a  number  of  U.S.  Navy  organizations  (the  Office 
of  Naval  Research,  Naval  Oceanographic  Office, 
and  Commander,  Naval  Oceanography  Command) 
and  the  National  Oceanographic  and  Atmospheric 
Administration  (NOAA),  the  other  interested  U.S. 
agency.  The  Chairman  of  the  lABP  is  Brian 
O’Donnell  of  the  Atmospheric  Environment  Ser¬ 
vice  in  Edmonton,  Alberta,  Canada.  Coordin  ation 
between  the  different  national  programs  is  main¬ 
tained  by  the  lABP  Coordinator,  who  presently  is 
Roger  Colony  of  the  Polar  Science  Center  of  the 
University  of  Washington. 
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ARCTIC  BUOYS 

The  interest  in  drifting  buoys  in  the  Arctic  is 
an  extension  of  drifting  buoy  programs  in  more 
temperate  oceans.  The  buoys  typically  send  the 
measured  surface  data  to  a  satellite  system;  in  the 
case  of  freely  drifting  buoys,  they  send  it  through 
the  Argos  satellite  system.  This  system  calculates 
the  position  of  the  buoy;  in  addition,  it  passes  the 
data  on  to  the  users.  The  primary  surface  data  for 
the  Arctic  drifters  is  pressure  because  this  is  the 
first  priority  variable  required  by  the  forecasting 
centers  to  improve  their  northern  predictions. 

Surface  winds  would  be  a  useftil  addition,  but 
they  are  not  necessary  because  the  speed  and  direc¬ 
tion  on  the  synoptic  scale  are  reasonably  accurately 
calculated  from  the  pressure  gradient  analyzed 
from  the  whole  grid  of  buoys.  Surface  tempera¬ 
ture  has  high  interest  because  of  climatic  interests 
and  research  on  thermodynamics  of  sea  ice,  but  the 
measured  ten^erature  is  considered  unreliable  in 
present  designs.  This  apparently  is  due  to  solar 
heating  of  the  instrument  housing  in  the  polar 
summer  and  tlie  insulation  of  overlying  snow  cover 
during  the  polar  winter. 

The  oceanographic  uses  of  these  buoys  are 
many.  The  buoy  positions  are  an  important  source 
of  data  on  drift  of  the  ice  cap.  Besides  the  air 
pressure  and  temperature  data,  the  buoys  also  are 
convenient  capsules  to  mount  sensors  for  oceano¬ 
graphic  data  collection.  Thermistor-conductivity 
sensor  strings,  current  meters,  and  hydrophones 
are  all  interesting  devices  for  research,  and  they  all 
have  been  u.sed  in  prototype  buoys. 

RESULTS  TO  DATE 

The  lABP  has  been  a  real  success  in  its  short 
1‘fe;  cooperation  lias  been  exceptionally  good.  In 
June,  1992,  about  SO  buoys  were  in  operation  in 
the  Arctic;  almost  IS  more  were  dq)loyed  during 
July-August  1992.  There  has  been  a  very  good 
return  of  data,  with  about  85  percent  of  tium 
reporting  on  the  GTS.  The  positions  are  moni¬ 
tored  by  the  coordimuor  and  are  published  in  a 
monthly  report  that  the  Polar  Science  Center  dis¬ 
tributes  to  interested  parties.  Plans  are  being  made 
to  fill  obvious  gaps  in  the  coverage  with  new 
buoys. 


There  presently  is  a  gap  on  the  Russian  side  of 
the  North  Pole  due  to  the  failure  of  several  buoys 
that  the  Arctic  and  Antarctic  Research  Institute 
(AARI)  had  previously  deployed.  Preliminary 
plans  are  being  made  to  fill  this  gap  in  the  near 
future.  Dr.  Nikiforov,  the  Scientific  Director  of 
AARI,  told  the  assembled  representatives  that  they 
were  having  trouble  with  their  transmitters,  but 
more  information  was  not  available  at  the  meeting. 

Sergey  Vasiljev  of  the  Russian  Committee  for 
Hydrometeorology  and  Monitoring  of  the  Environ¬ 
ment  stated  that  the  lABP  continues  to  be  an  im¬ 
portant  program  for  Russia,  and  they  were  endeav¬ 
oring  to  continue  their  support  for  this  program  in 
the  face  of  their  severe  financial  situation.  This  is 
consistent  with  other  opinions  I  obtained  during  a 
subsequent  visit  to  AARI,  as  this  plays  a  support¬ 
ive  role  in  the  Institute’s  forecasting  for  the  safety 
of  shipping  on  the  Northern  Sea  Route. 

Dr.  Victor  Savtchenko  who  spent  20  years  at 
AARI  but  now  is  with  the  World  Meteorological 
Office  (WMO)  in  Geneva  takes  his  WMO  position 
seriously.  He  was  pushing  AARI  to  fulfill  its 
commitment  to  provide  services  to  international 
programs  such  as  WOCE  and  lABP  as  per  prior 
agreements.  He  voiced  a  strong  objection  to  Dr. 
Nikiforov  regarding  AARI’s  reluctance  to  provide 
adequate  infonnation  about  the  status  and  accuracy 
of  their  instrument  developments.  This  came  up 
during  presentations  in  which  it  was  apparent  that 
AARI  is  having  trouble  developing  and  deploying 
drifting  buoys  that  work  as  designed. 

Dr.  Savtchenko  was  helpful  to  me  in  attempt¬ 
ing  to  find  out  just  how  they  do  their  positioning  of 
the  buoys.  It  seems  that  they  use  the  Doppler  shift 
of  the  transmitter  and  the  known  orbit  of  the  satel¬ 
lite  to  calculate  the  position,  just  as  is  done  in  Che 
West  with  the  Argos  system.  They  say  they  need 
an  additional  buoy  in  a  fixed  position  on  an  island 
to  obtain  good  fixes,  but  I  did  not  (still  don’t) 
understand  why  this  is  so.  There  tqiparently  is  an 
offset  in  the  calculated  position  that  they  correct  by 
using  the  known  position  of  the  additional  buoy.  I 
couldn’t  find  out  whether  this  is  due  to  unknown 
drift  of  the  transmitter  frequency,  or  unknown 
position  of  the  satellite,  or  some  other  uncertainty. 
Nor  was  I  any  more  successful  in  finding  an  an¬ 
swer  to  this  question  during  my  subsequent  visit  to 
AARI. 
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INTRODUCTION 

Riysical  processes  at  and  near  the  ocean  sur¬ 
face  are  ftindamentally  dq>endent  on  sub-mesoscale 
interactions  because  of  the  intrinsically  small-scale 
itiKure  of  the  interface,  as  exemplified  by  mrbulent 
energy  in  those  ocean  surface  gravity  waves  and 
surface-generated  turbulence.  The  scales  of  motion 
extend  from  those  on  whidi  energy  is  dissipated 
(millim^r  to  centimeter)  to  scales  associated  with 
the  dq>th  of  the  mixed  layers,  namely  the  lower 
Idlometer  In  the  atmosphere  and  the  upper  1(X) 
meters  in  the  ocean.  These  scales  typically  are 
smaller  than  those  associated  with  tlw  "synqptic" 
motitHis  in  eithor  the  atmoq^Mne  or  die  ocean.  An 
undersmnding  of  the  momentum,  heat,  and  mass 
fluxes  requires  siimtluneous  comideration  of  atmo- 
8|dmric,  ooeanto,  and  surftM  processes  u  a  cmi- 
pled  system.  This  imemting  and  difficult  uea  has 
been  the  subje^  of  increasing  levels  of  researdi. 

Because  of  this  interest,  "Sub-Mesoscaie 
Air-SM  btteractions*  was  diosen  as  the  topic  of  the 
24th  Li^e  Colloquium  on  Ocean  Hydrodynamics. 


The  colloquium  was  held  4-8  May  1992  at  the 
University  of  Liege  and  was  hosted  by  Profs. 
Jacques  Nihoul  and  Salim  Djenidi.  The  scientific 
committee,  co-chaired  by  Prof.  Nihoul  and  Dr. 
Alan  Brandt,  organized  a  broadly  based  confer¬ 
ence;  contributions  included  papers  on  the  atmo¬ 
spheric  boundary  layer,  the  ocean  nutted  layer,  and 
the  nature  of  the  interface  itself.  The  studies  in¬ 
cluded  all  major  oceans. 

The  Li  ige  colloquia  are  noted  for  their  broad 
intemtfional  attendai^.  Particularly  noteworthy 
was  the  large  participation  by  scientists  fh>m  the 
former  Soviet  Union-more  than  10  percent  of  the 
total  of  iiKire  foan  80  scientists.  Also  noteworthy 
was  the  preseiuation  of  the  Doctor  Honoris  Causa 
of  the  University  of  Liege  to  Prof.  Eric  Kraus  at 
tlM  colloquium  banquM.  Professor  Kraus  was 
honored  for  hU  leadership  and  lifiaime  research 
contributions  to  the  air-sea  interaction  uid  oceuio- 
graphic  disciplines.  Professor  Kraus  presented  the 
initial  and  concluding  colloquium  sminars:  "What 
We  Do  Not  Know  About  Sub-Mesoscale  Air-Sea 
Interactions  "  and  "What  We  Do  Know  About 
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Sub-Mesoscale  Air-Sea  Interactions."  To  suggest 
the  flavor  of  the  colloquium,  we  summarize  some 
of  the  presentations  we  felt  most  interesting,  fol¬ 
lowed  by  a  summary  of  Prof.  Kraus’  final  presen¬ 
tation. 

SELECTED  SUMMARIES 

To  set  the  stage  for  the  ensuing  colloquium, 
Alan  Brandt  of  the  Office  of  Naval  Research 
(ONR)  presented  an  ovendew  of  the  scope  of 
sub-mesoscale  air-sea  interactions.  Characteristics 
of  these  interactions  that  motivate  and  challenge  the 
scientific  communit>'  were  listed  as: 

•  sub-mesoscale  processes  are  fundamental  to 
understanding  air-sea  fluxes; 

•  small-scale  closure  is  critical  to  meso/global 
models; 

•  atmosphere  and  ocean  both  exhibit  intermit¬ 
tent,  nonstationary  behavior; 

•  air-ocean  boundary  layers  have  similar 
structure: 

•  a  fundamental  scale  mismatch  exists  be¬ 
tween  the  atmosphere  (-days,  ~  1000  km) 
and  the  ocean  ( -  month,  100  km);  and 

•  coupled,  air-surface-ocean  processes  must 
be  studied  at  small  scales. 

Brandt  also  presented  high-frequency  tenqiera- 
ture  measurements,  obtained  just  below  the  surface 
(average  depth  of  ~  1 .5  m)  fiom  a  moving  ship. 
These  data  exhibited  intermittent  patterns  represen¬ 
tative  of  coherent  structures  u  horizontal  sc^es  of 
2-5  m.  These  "microstructures"  are  1-2  orders  of 
magnitude  smaller  tlum  previously  seen. 

Following  this,  in  his  lecture  on  "What  We  Do 
Not  Know...",  Prof.  Kraus  posed  three  questions: 

1.  Can  we  specify  air-sea  interfacial  velocity 
and  mean  profiles  above  and  below  the 
inteifue? 

2.  What  is  the  role  of  coherent  structures  and 
how  can  their  vertical  transport  of  imown- 
tum  be  parameterized? 

3.  How  does  die  downward  momentum  flux 
evolve  and  converge? 


Discussions  relevant  to  these  questions  continued 
throughout  the  colloquium. 

Joe  Fernando  of  Arizona  State  University 
presented  laboratory  results  on  fluid  motions  result¬ 
ing  from  the  impingement  of  a  descending  thermal 
on  a  sharp  density  interface.  This  is  relevant  to 
convection  resulting  from  leads  in  the  Arctic  ice 
cover.  It  was  found  that  the  thermal  plume  con- 
vects  surface  water  downward  and  impinges  on  the 
density  interface,  resulting  in  vortices  trapped  at 
the  interface  as  well  as  propagating  interfacial 
waves.  The  complexities  of  Arctic  lead-generated 
surface  layer  mixing  were  well  illustrated,  but  the 
efiects  of  lateral  flow  due  to  movement  of  the  ice 
across  the  water  (resulting  from  wind  stress)  can¬ 
not  be  examined  easily  in  the  laboratory. 

M.  Cure  (Southampton  University)  showed 
very  intriguing  data  from  a  high-frequency  sidescan 
sonar  used  to  detect  subsurface  bubble  clouds 
caused  by  breaking  wind  waves.  These  results, 
obtained  in  Loch  Ness  and  the  North  Sea,  clearly 
showed  persistent  2-50  m  linear  patterns  in  the 
subsurface  distributions  of  bubbles.  The  acoustic 
signals  are  scattered  from  the  bubbles,  which  are 
organized  and  subducted  by  near-surface  conver¬ 
gence  patterns.  These  patterns  appear  to  be  signa¬ 
tures  of  subsurface  coherent  structures,  features 
that  are  commonly  called  Langmuir  cells. 

Iosif  Lozovatsky  (Shirshov  Institute  of  Ocean¬ 
ology)  gave  two  presentations.  In  the  first,  he 
discussed  a  joint  U.S.  and  fonner  U.S.S.R.  experi¬ 
ment  during  which  turbulence  was  measured  in  the 
wake  of  the  Equatorial  Current  from  Baker  Island 
in  the  Pacific  Ocean.  United  States  participants  in 
the  measurements  were  Baker  and  Gibson.  These 
field  measurements  are  a  good  example  of  a  labo¬ 
ratory-type  hydrodynamic  experiment  at  geophysi¬ 
cal  scales,  where  velocity  and  ten^rature  micro¬ 
structure  were  measured.  Preliminary  results  on 
microstructure  and  intermittency  were  presented. 

In  the  second  talk,  Lozovatsky  discussed  observa¬ 
tions  Mid  modeling  of  turbulence  and  finestructure. 
Paruneterizations  for  turbulent  eddy  viscosity  aikl 
diffUsivity  were  developed^  based  on  probabilistic 
modeling  to  simulate  intermittency  and  sheet-layer 
structure  in  the  stratified  upper  ocean. 

Fraidc  Bradley  [Commonwealth  Scientific  and 
Industrial  Research  Organization  (CSIRO),  Austra¬ 
lia]  reported  on  an  experimental  validation  of  the 
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Liu-Katearos-Businger  (LBK)  formula  for  surface 
roughness  using  preliminary  ship  data  from  the 
Tropical  Ocean  Global  Atmosphere-Coupled 
Ocean-Atmosphere  Response  Experiment 
(TOGA-COARE).  Data  included  both  eddy  corre- 
iation  and  dissipation  measures  of  momentum, 
heat,  and  moisture  fluxes.  Bradley  found  that  the 
LKB  formula  compares  very  well  to  measurements 
for  windspeeds  below  7  m/s  but  overestimates  the 
flux  above  this  windspeed. 

Rostislav  Ozmidov  (Shirshov  Institute)  present¬ 
ed  fluxes  of  momentum,  sensible  heat,  and  latent 
heat  measured  by  ship  in  the  Newfoundland  and 
Azores  regions  in  19%  and  1991.  The  sensor 
packages  were  deployed  on  a  forward  boom  that 
provided  height  dependencies  of  flux  cospectra. . 
The  experiments  ajiso  included  subsurface  artiiys  to 
map  upper  ocean  fmestructure  as  well  as  a  down¬ 
ward-looking  elect! c  optical  video  system  to  deter¬ 
mine  gravity  and  capillary  wavenumber  spectra. 
Analysis  of  the  video  data  is  still  ongoing,  and  the 
plan  is  to  integrate  upper-ocean  turbulence,  surface 
fluxes,  and  surface  wavenumber  spectra  into  a 
more  realistic  theoretical  framework. 

An  experimental  erfort  to  relate  the  wind  stress 
direction  to  the  directional  spectrum  of  large  orbit¬ 
al  velocity  waves  was  presented  by  Gary  Geemaeit 
(ONR).  During  near-neutral  stratification  and 
nearly  constant  windspeed  (“4  m/c),  he  found  a 
strong  dependence  of  stress  direction  on  swell 
direction.  These  results  were  suggested  to  be 
important  in  regions  where  swell  are  large,  have 
preferred  directions,  and  are  not  propagating  in  the 
direction  of  the  local  wind,  e.g.,  in  coastal  do¬ 
mains,  near  storms,  and  in  the  tropics. 

John  Bane  (University  of  North  Carolina) 
showed  the  in^rtance  of  air-sea  fluxes  on  synop¬ 
tic-scale  atmospheric  and  mesoscale  ocean  phenom¬ 
ena.  In  particular,  he  used  data  and  numerical 
model  studies  to  show  how  cold  air  outbreaks  off 
the  U.S.  East  Coast  can  modify  underlying  sea- 
surface  and  upper-ocean  thermal  structure.  In  his 
presentation,  T.  Nishimura  (Science  Univemity  of 
Tokyo)  showed  related  processes  off  the  JI^}anese 
coast,  where  tc^graphy  is  an  important  element  in 
the  ocean  sea  surface  terqperature  pattern  evolu¬ 
tion. 

Alexander  Soloviev  (Shirshov  Institute)  also 
gave  two  presentations.  The  first  was  on 
parameterization  of  heat  and  gas  transfer  in  molec¬ 


ular  sublayers  at  the  ocean  surface.  This 
parameterization  was  based  on  a  surface  Richard¬ 
son  number,  which  controls  transition  from  con¬ 
vective  instabilit>’  to  wind-induced  instability  ("roll¬ 
ers "  on  breaking  waves)  and  the  Keulegan  number, 
which  controls  transition  from  the  rollers  to 
longwave  breaking.  In  the  second  presentation, 
Soloviev  reported  field  measurements  made  in  the 
tropical  Atlantic  and  in  the  Azores  frontal  zone  by 
using  a  high-resolution  temperature-conductivity 
sensor  mounted  on  the  bow  of  a  research  vessel  at 
l.S-2  m  depth.  These  data  reveal  sharpening  of 
the  windward  side  of  density  depression  pools 
(warm  and/or  freshened),  acconqianied  by  strongly 
asymmetrical  disturbances  and  sometimes  by  bursts 
of  short  wave  trains.  The  disturbances  seemingly 
result  from  development  of  internal  bores  in  buoy¬ 
ancy-driven  horizontal  spreading  of  the  density 
depression  pool  against  the  wind-driven  current. 
Strong  intensification  of  small-scale  fluctuations 
within  the  borelike  interfaces  provided  evidence  in 
favor  of  large  values  of  turbulent  entrainment 
fluxes,  which  have  ro  be  taken  into  account  in 
modeling  the  response  of  warm  or  freshened  pools 
to  wind  forcing. 

Sergey  Gulev  (State  Oceanography  Institute) 
described  results  of  an  innovative  analysis  of 
space-time  averaging  of  bulk  meteorological  and 
ocean  surface  quantities  (winds,  temperatures, 
humidity)  and  their  differences,  and  the  role  of 
averaging  in  flux  estimates.  Results  indicate  that 
regions  near  western  boundary  currents  require  1- 
hour  averages  to  accurately  determine  monthly 
fluxes  for  climate  studies.  Differences  between 
1 -month  averages  vs  the  average  of  hourly  obser¬ 
vations  can  result  in  errors  in  flux  estimates  ap¬ 
proaching  40  percent. 

A  most  interesting  presentation  on  studies  of 
inertial  resonance  produced  by  an  oceanic  jet  was 
given  by  Patrice  Klein  [Institut  Francais  de  Recher¬ 
che  pour  rBxploltation  de  la  Mer  (Brest)  French 
Ocean  Research  Institute  (IFAEMER),  Brest]. 

This  study  of  sub -mesoscale  jets  is  relevant  tc 
processes  occurring  at  the  edge  of  a  mesr  ^cale 
eddy  or  at  frontal  boundaries.  A  combination  of 
analytic  and  numerical  studies  illustrated  many 
interesting  nonlinear  phenomena,  including  the 
Doppler  shift  of  internal  waves  due  to  Ekman  drift, 
experimental  growth  or  decay  of  inertial  motions 
due  to  resonance  phenomena,  effects  of  turbulent 


536 


E8NIB  92-08 


entrainment,  and  vertical  wave  propagation  sub¬ 
stantially  below  the  mixed  layer  depth. 

Isaac  Ginis  [Geophysical  Fluid  Dynamics 
Laboratory  (GFDL),  National  Oceanic  and  Atmo¬ 
spheric  Administration  (NOAA)],  discussed  the 
coupling  between  air-sea  interactions  and  the  un¬ 
derlying  ocean  during  hurricanes.  He  used  an 
18-iayer,  triply  nested  movable  mesh,  hurricane 
model  in  the  atmosphere  coupled  to  a  7-layer  (1/6- 
deg  resolution)  primitive  equation  ocean  model  to 
show  how  coupling  influences  the  intensity  and 
motion  of  hurricanes.  His  results  indicate  that 
hurricanes  can  be  sensitive  to  cooling  of  the  .ssa 
surface,  with  the  intensity  being  reduced  lS-20 
percent  with  coupling  and  that  the  track  deviates 
from  the  uncoupled  case  only  for  slowly  moving 
storms.  These  results  need  to  be  verified  by  obser¬ 
vation. 

CONCLUDING  REMARKS 

Professor  Kraus  summarized  the  colloquium  in 
the  final  presentation.  He  suggested  that  there 
were  two  primary  motivations  for  the  meeting, 

•  to  understand  and  describe  air-sea  process¬ 
es,  and 

•  to  understand  and  parameterize  the  effects 
of  these  interactions  on  the  larger  scale. 

The  colloquium  clearly  paid  more  attention  to  the 
former  than  the  latter. 

The  colloquium  papers  fell  into  six  categories 
based  on  the  spatial  scales  under  study. 

Surface/skin  processes  at  the  small  end,  where  a 
number  of  papers  described  physical  modeling  of 
surface  renewal.  Turbulence  and  microstructure 
were  next,  where  considerable  Russian  data  was 
impressive  but  still  largely  unused  elsewhere. 

Waves  at  the  next  scale,  where  the  effects  ot  sea 
state  and  swell  on  drag  coe^icient,  was  actively 
discussed.  Next  were  Langmuir  cells,  where 
acoustic  instrumentation  is  making  these  phenome¬ 
na  more  clearly  observed  than  ever  before,  but  still 
not  better  understood.  Next  came  mixed  layers. 


where  heat  and  momentum  flux  are  transferred 
between  the  near-surface  layer  and  the  thermocline. 
The  largest  scale  phenomena  discussed  were  meso- 
scale  ocean  phenomena,  such  as  the  Gulf  Stream 
and  Kuroshio  Current,  that  are  affected  by  synop¬ 
tic-scale  atmospheric  forcing. 

The  proceedings  will  be  published  in  a  differ¬ 
ent  format  from  earlier  years,  now  being  a  dedicat¬ 
ed  issue  of  the  Journal  of  Marine  Systems,  for 
which  Prof.  Nihoul  serves  as  an  editor.  This  will 
replace  the  well-known  series  of  specially  printed 
colloquia  volumes.  This  change  will  entail  the 
refereeing  of  all  submissions,  and  this  is  good  and 
bad  news.  It  will  make  the  final  product  more 
uniform  in  appearance,  and  it  will  weed  out  the 
poorer  products,  but  this  namrally  will  significantly 
reduce  the  number  of  papers  appearing  in  print 
from  those  presented  at  the  meeting. 

SUMMARY 

One  measure  of  the  success  of  past  Internation¬ 
al  Liege  Colloquia  on  Ocean  Hydrodynamics  has 
been  the  broad  participation  that  they  have  attract¬ 
ed.  This  colloquium  was  particularly  successful  in 
that  regard,  attracting  scientists  from  14  countries 
of  North  America,  Europe,  Asia  and  Australia. 

This  participation  is  due  to  the  diligence  of  the 
conference  organizers  and  the  widespread  interest 
in  the  topic. 
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